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Chapter 1 

Introduction 



The title page of this book contains a graphic that we reproduce here. 




It is intended to evoke thoughts of the scientific method. 

A hypothesis analyzed by a person produces a prediction, which motivates the 
specification of an experiment, which when executed results in an observation, 
which analyzed by a person yields a hypothesis. 

This sounds valid, and a good graphic can be exceptionally useful for leading a reader 
through the story that the author wishes to tell. 

Interestingly, a graphic has the power to evoke feelings of understanding, without 
really meaning much. The same is true for text: it is possible to use a language such as 
English to express ideas that are never made rigorous or clear. When someone says "I 
believe in free will," what does she believe in? We may all have some concept of what 
she's saying — something we can conceptually work with and discuss or argue about. But 
to what extent are we all discussing the same thing, the thing she intended to convey? 

Science is about agreement. When we supply a convincing argument, the result 
of this convincing is agreement. When in an experiment the observation matches the 
hypothesis — success! — that is agreement. When my methods make sense to you, that is 
agreement. When practice does not agree with theory, that is disagreement. Agreement 
is the good stuff in science; it's the high fives. 
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But it is easy to think we're in agreement, when reahy we're not. Modehng our 
thoughts on heuristics and pictures may be convenient for quick travel down the road, 
but we're liable to miss our turnoff at the first mile. The danger is in mistaking our 
convenient conceptualizations for what's actually there. It is imperative that we have 
the ability at any time to ground out in reality. What does that mean? 

Data. Hard evidence. The physical world. It is here that science touches down and 
heuristics evaporate. So let's look again at the diagram on the cover. It is intended 
to evoke an idea of how science is performed. Is there hard evidence and data to back 
this theory up? Can we set up an experiment to find out whether science is actually 
performed according to such a protocol? To do so we have to shake off the stupor evoked 
by the diagram and ask the question: "what does this diagram intend to communicate?" 

In this course I will use a mathematical tool called ologs^ or ontology logs, to give 
some structure to the kinds of ideas that are often communicated in pictures like the 
one on the cover. Each olog inherently offers a framework in which to record data about 
the subject. More precisely it encompasses a database schema^ which means a system of 
interconnected tables that are initially empty but into which data can be entered. For 
example consider the olog below 



a mass 



has 



as mass 



an object of mass 
m held at height h 
above the ground 



has as height 
in meters 



a real number h 



when dropped has 
as number of sec- 
onds tin hitting the 
ground 




a real number 



This olog represents a framework in which to record data about objects held above the 
ground, their mass, their height, and a comparison (the ?-mark in the middle) between 
the number of seconds till they hit the ground and a certain real- valued function of their 
height. We will discuss ologs in detail throughout this course. 

The picture in (1.1) looks like an olog, but it does not conform to the rules that 
we lay out for ologs in Section 2.3. In an olog, every arrow is intended to represent 
a mathematical function. It is difficult to imagine a function that takes in predictions 
and outputs experiments, but such a function is necessary in order for (1.1) to make 
sense. To produce an experiment design from a prediction probably requires an expert, 
and even then the expert may be motivated to specify a different experiment on Tuesday 
than he is on Monday. But perhaps our criticism has led to a way forward: if we say that 
every arrow represents a function when in the context of a specific expert who is actually 
doing the science at a specific time^ then figure 1.1 begins to make sense. In fact, we 
will return to the figure in Section 5.3 (specifically Example 5.3.3.9), where background 
methodological context is discussed in earnest. 

This course is an attempt to extol the virtues of a new branch of mathematics, 
called category theory^ which was invented for powerful communication of ideas between 
different fields and subfields within mathematics. By powerful communication of ideas I 
actually mean something precise. Different branches of mathematics can be formalized 
into categories. These categories can then be connected together by functors. And the 
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sense in which these functors provide powerful communication of ideas is that facts and 
theorems proven in one category can be transferred through a connecting functor to 
yield proofs of an analogous theorem in another category. A functor is like a conductor 
of mathematical truth. 

I believe that the language and toolset of category theory can be useful throughout 
science. We build scientific understanding by developing models, and category theory is 
the study of basic conceptual building blocks and how they cleanly fit together to make 
such models. Certain structures and conceptual frameworks show up again and again in 
our understanding of reality. No one would dispute that vector spaces are ubiquitous. 
But so are hierarchies, symmetries, actions of agents on objects, data models, global 
behavior emerging as the aggregate of local behavior, self-similarity, and the effect of 
methodological context. 

Some ideas are so common that our use of them goes virtually undetected, such as set- 
theoretic intersections. For example, when we speak of a material that is both lightweight 
and ductile, we are intersecting two sets. But what is the use of even mentioning this 
fact? The answer is that when we formalize our ideas, our understanding is almost 
always clarified. Our ability to communicate with others is enhanced, and the possibility 
for developing new insights expands. And if we are ever to get to the point that we can 
input our ideas into computers, we will need to be able to formalize these ideas first. 

It is my hope that this course will offer scientists a new vocabulary in which to 
think and communicate, and a new pipeline to the vast array of theorems that exist and 
are considered immensely powerful within mathematics. These theorems have not made 
their way out into the world of science, but they are directly applicable there. Hierarchies 
are posets, symmetries are group elements, data models are categories, agent actions are 
monoid actions, local-global principles are sheaves, self-similarity is modeled by operads, 
context can be modeled by monads. 

1.1 A brief history of category theory 

The paradigm shift brought on by Einstein's theory of relativity brought on the real- 
ization that there is no single perspective from which to view the world. There is no 
background framework that we need to find; there are infinitely many different frame- 
works and perspectives, and the real power lies in being able to translate between them. 
It is in this historical context that category theory got its start. ^ 

Category theory was invented in the early 1940s by Samuel Eilenberg and Saunders 
Mac Lane. It was specifically designed to bridge what may appear to be two quite 
different fields: topology and algebra. Topology is the study of abstract shapes such as 
7-dimensional spheres; algebra is the study of abstract equations such diS y'^z = — xz^. 
People had already created important and useful links (e.g. cohomology theory) between 
these fields, but Eilenberg and Mac Lane needed to precisely compare different links with 
one another. To do so they first needed to boil down and extract the fundamental nature 
of these two fields. But the result was a structure that fit not only topology and algebra, 
but many other mathematical disciplines as well. 

At first category theory was little more than a deeply clarifying language for existing 
difficult mathematical ideas. However, in 1957 Alexander Grothendieck used category 

■"^The following history of category theory is far too brief, and it may leave out important contributions 
or credit some with what should more aptly be credited to others. If you happen to know of such an 
error, please contact the author. 
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theory to build new mathematical machinery (new cohomology theories) that granted 
unprecedented insight into the behavior of algebraic equations. Since that time, cat- 
egories have been built specifically to zoom in on particular features of mathematical 
subjects and study them with a level of acuity that is simply unavailable elsewhere. 

Bill Lawvere saw category theory as a new foundation for all mathematical thought. 
Mathematicians had been searching for foundations in the 19th century and were reason- 
ably satisfied with set theory as the foundation. But Lawvere showed that the category 
of sets is simply a category with certain nice properties, not necessarily the center of 
the mathematical universe. He explained how whole algebraic theories can be viewed as 
examples of a single system. He and others went on to show that logic was beautifully 
captured in the setting of category theory (or more specifically toposes). It is here also 
that Grothendieck and his school worked out major results in algebraic geometry. 

In 1980 Joachim Lambek showed that the types and programs used in computer 
science form a specific kind of category. This provided a new semantics for talking about 
programs, allowing people to investigate how programs combine and compose to create 
other programs, without caring about the specifics of implementation. Eugenio Moggi 
brought the category theoretic notion of monads into computer science to encapsulate 
ideas that up to that point were considered outside the realm of such theory. 

It is difficult to explain the clarity and beauty brought to category theory by people 
like Daniel Kan and Andre Joyal. They have each repeatedly extracted the essence of a 
whole mathematical subject to reveal and formalize a stunningly simple yet extremely 
powerful pattern of thinking, revolutionizing how mathematics is done. 

All this time, however, category theory was consistently seen by much of the mathe- 
matical community as ridiculously abstract. But in the 21st century it has finally come 
to find healthy respect within the larger community of pure mathematics. It is the lan- 
guage of choice for graduate- level algebra and topology courses, and in my opinion will 
continue to establish itself as the basic framework in which mathematics is done. 

As mentioned above category theory has branched out into certain areas of science 
as well. Baez and Dolan have shown its value in making sense of quantum physics, it 
is well established in computer science, and it has found proponents in several other 
fields as well. But to my mind, we are the very beginning of its venture into scientific 
methodology. Category theory was invented as a bridge and it will continue to serve in 
that role. 

1.2 Intention of this book 

The world of applied mathematics is much smaller than the world of applicable math- 
ematics. This course is intended to create a bridge to the vast array of mathematical 
concepts that are used daily by mathematicians to describe all manner of phenomena 
that arise in our studies, and which I have found naturally arise in thinking about physics, 
computation, and neuroscience. 

To the pure mathematician I'll try to prove that concepts such as categories, func- 
tors, natural transformations, limits, colimits, functor categories, sheaves, monads, and 
operads — concepts that are often considered too abstract for even math majors — can 
be communicated to scientists with no math background beyond linear algebra. If this 
material is as teachable as I think, it means that category theory is not esoteric but 
somehow well- aligned with ideas that already make sense to the scientific mind. Note, 
however, that this book is example-based rather than proof-based, so it may not be 
suitable as a reference for students of pure mathematics. 
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To the scientist I'll try to prove the claim that category theory includes a formal 
treatment of conceptual structures that the scientist sees often perhaps without realizing 
that there is well-oiled mathematical machinery to be employed. We will work on the 
structure of information; how data is made meaningful by its connections, both internal 
and out reaching, to other data. Note, however, that this book should most certainly not 
be taken as a reference on scientific disciplines. One should assume that any account of 
physics, materials science, chemistry, etc. has been oversimplified. The intention is to 
give a flavor of how category theory may help us model sclent iflc ideas, not to explain 
these ideas in a serious way. 

Data gathering is ubiquitous in science. Giant databases are currently being mined 
for unknown patterns, but in fact there are many (many) known patterns that simply 
have not been catalogued. Consider the well-known case of medical records. A patient's 
medical history is often known by various individual doctor-ofiices but quite inadequately 
shared between them. Sharing medical records often means faxing a hand-written note 
or a flUed-in house-created form between oflices. 

Similarly, in science there exists substantial expertise making brilliant connections 
between concepts, but it is being conveyed in silos of English prose known as journals 
articles. Every paper has a methods section, but it is almost impossible to read a methods 
section and subsequently repeat the experiment — the English is inadequate to concisely 
convey what is being done. 

The first thing to understand is that reusable methodologies can be formalized, and 
that doing so is inherently valuable. Consider the following analogy. Suppose you want 
to add up the area of a region in space (or the area under a curve). You break the region 
down into small squares, each of which you know has area A; then you count the number 
of squares, say n, and the result is that the region has an area of about nA. If you want 
a more precise and accurate result you repeat the process with half-size squares. This 
methodology can be used for any area-finding problem (of which there are more than a 
first-year calculus student generally realizes) and thus it deserves to be formalized. But 
once we have formalized this methodology, it can be taken to its limit and out comes 
integration by Riemann sums. 

I intend to show that category theory is incredibly eflicient as a language for ex- 
perimental design patterns, introducing formality while remaining flexible. It forms a 
rich and tightly woven conceptual fabric that allows the scientist to maneuver between 
different perspectives whenever the need arises. Once one builds that fabric for oneself, 
he or she has an ability to think about models in a way that simply would not occur 
without it. Moreover, putting ideas into the language of category theory forces a person 
to clarify their assumptions. This is highly valuable both for the researcher and for his 
or her audience. 

What must be recognized in order to flnd value in this course is that conceptual chaos 
is a major problem. Creativity demands clarity of thinking, and to think clearly about 
a subject requires an organized understanding of how it flts together. Organization and 
clarity also lead to better communication with others. Academics often say they are 
paid to think and understand, but that is not true. They are paid to think, understand, 
and communicate their findings. Universal languages for science — languages such as 
calculus and differential equations, matrices, or simply graphs and pie-charts — grant us 
a cohesiveness that makes scientific research worthwhile. In this book I will attempt 
to show that category theory can be similarly useful in describing complex scientific 
understandings. 
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1.3 What is requested from the student 

I will do my best to make clear the value of category theory in science, but I am not a 
scientist. To that end I am asking for your help in exploring how category theory may 
be useful in your specific field. 

I also want you to recognize that the value of mathematics is not generally obvious 
at first. A good student learning a good subject with a good teacher will see something 
compelling almost immediately, but may not see how it will be useful in real life. This 
will come later. I hope you will work hard to understand even without yet knowing what 
its actual value in your life and research will be. Like a student of soccer is encouraged 
to spend hours juggling the ball when he or she could be practicing penalty shots, it 
is important to gain facility with the materials you will be using. Doing exercises is 
imperative for learning mathematics. 

1.4 Category theory references 

I wrote this book because the available books on category theory are almost all written 
for mathematicians (the rest are written for computer scientists). There is one book by 
Lawvere and Schanuel, called Conceptual Mathematics [LS], that offers category theory 
to a wider audience, but its style is not appropriate for this course. Still, it is very well 
written and clear. 

The "bible" of category theory is Categories for the working mathematician by Mac 
Lane [Mac]. But as the title suggests, it was written for working mathematicians and 
will be quite opaque to my target audience. However, once a person has read this book, 
Mac Lane's book may become a valuable reference. 

Other good books include Steve Awodey's book Category theory [Awo] and Barr and 
Wells book Category theory for computing science, [B W] . And online I find wikipedia 
and a site called the nlab http : //neat lab . org/nlab/ show/HomePage to be quite useful. 

This book attempts to explain category theory by example and exercise rather than 
by theorems and proofs. I hope this approach will be valuable to the working scientist. 
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Chapter 2 

The category of sets 



The theory of sets was invented as a foundation for all of mathematics. The notion of 
sets and functions serves as a basis on which to build our intuition about categories in 
general. In this chapter we will give examples of sets and functions and then move on 
to discuss commutative diagrams. At this point we can introduce ologs which will allow 
us to use the language of category theory to speak about real world concepts. Then we 
will introduce limits and colimits, and their universal properties. All of this material is 
basic set theory, but it can also be taken as an investigation of our first category, the 
category of set which we call Set. We will end this chapter with some other interesting 
constructions in Set that do not fit into the previous sections. 

2.1 Sets and functions 
2.1.1 Sets 

In this course I'll assume you know what a set is. We can think of a set X as a collection 
of things X E X, each of which is recognizable as being in X and such that for each pair 
of named elements x^x^ e X we can tell if x = x^ or not. ^ The set of pendulums is the 
collection of things we agree to call pendulums, each of which is recognizable as being a 
pendulum, and for any two people pointing at pendulums we can tell if they're pointing 
at the same pendulum or not. 

Notation 2.1.1.1. The symbol denotes the set with no elements. The symbol N 
denotes the set of natural numbers, which we can write as 

N:= {0,1,2, 3, 4,..., 877,...}. 

The symbol Z denotes the set of integers, which contains both the natural numbers and 
their negatives, 

Z:= {...,-551,. ..,-2,-1,0, 1,2,...}. 

If A and B are sets, we say that A is a subset of 5, and write A ^ 5, if every element 
of A is an element of B. So we have M ^ Z. Checking the definition, one sees that 

■"^Note that the symbol x' , read "x-prime", has nothing to do with calculus or derivatives. It is simply 
notation that we use to name a symbol that is suggested as being somehow like x. This suggestion 
of kinship between x and x' is meant only as an aid for human cognition, and not as part of the 
mathematics. 
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X 




A 





Figure 2.1: A set X with 9 elements and a set Y with no elements, Y = 0. 



for any set A, we have (perhaps uninteresting) subsets ^ A and A ^ A. We can 
use set-builder notation to denote subsets. For example the set of even integers can be 
written {n e Z | n is even}. The set of integers greater than 2 can be written in many 
ways, such as 

{n e Z I n > 2} or {n e N | n > 2} or {n e N | n ^ 3}. 



The symbol 3 means "there exists". So we could write the set of even integers as 



The symbol 3! means "there exists a unique". So the statement "3!x e R such that = 
0" means that there is one and only one number whose square is 0. Finally, the symbol 
V means "for all". So the statement "Vm e N 3n e N such that m < n" means that for 
every number there is a bigger one. 



Exercise 2.1.1.2. Let A = {1,2,3}. What are ah the subsets of A7 Hint: there are 8. 



2.1.2 Functions 



{n e Z I n is even} 



{n G Z I 3m e Z such that 2m = n}. 



If X and Y are sets, then a function f from X to F, denoted / : X ^ F, is a mapping 
that sends each element x e X to an element of F, denoted f{x) e Y. We call X the 
domain of the function / and we call Y the codomain of /. 
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X Y 




(2.2) 



Note that for every element x e there is exactly one arrow emanating from 
but for an element y ^ there can be several arrows pointing to or there can be no 
arrows pointing to y. 

Application 2.1.2.1. In studying the mechanics of materials, one wishes to know how a 
material responds to tension. For example a rubber band responds to tension differently 
than a spring does. To each material we can associate a force-extension curve, recording 
how much the force the material carries when extended to various lengths. Once we 
fix a methodology for performing experiments, finding a material's force-extension curve 
would ideally constitute a function from the set of materials to the set of curves. ^ 

00 

Exercise 2.1.2.2. Here is a simplified account of how the brain receives light. The eye 
contains about 100 million photoreceptor (PR) cells. Each connects to a retinal ganglion 
(RG) cell. No PR cell connects to two different RG cells, but usually many PR cells can 
attach to a single RG cell. 

Let PR denote the set of photoreceptor cells and let RG denote the set of retinal 
ganglion cells. 

a. ) According to the above account, does the connection pattern constitute a function 

RG PR, a function PR RG or neither one? 

b. ) Would you guess that the connection pattern that exists between other areas of the 

brain are "function-like"? 



Example 2.1.2.3. Suppose that X is a set and X' ^ X is a subset. Then we can consider 
the function X' ^ X given by sending every element of X' to "itself" as an element of 
X. For example if X = {a, 6, c, e, /} and X' = {6, e} then X' ^ X and we turn that 
into the function X ^ X' given hy b ^ d ^ e ^ e. 

As a matter of notation, we may sometimes say something like the following: Let X 
be a set and let i : X' ^ X be a subset. The good thing about this is that given a function 
/: X ^ F, we can compose it with i: X^ ^ X and get a function f oi: X' . This 
is often called the restriction of f to X' , and we have obtained it simply by function 
composition. 

^In reality, different samples of the same material, say samples of different sizes or at different 
temperatures, may have different force-extension curves. If we want to see this as a true function whose 
codomain is curves it should have as domain something like the set of material samples. 
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Given a function f : X ^ Y^ the elements of Y that have at least one arrow pointing 
to them are said to be in the image of /; that is we have 

im(/) := {yeY \ 3xe X such that f{x) = y}. (2.3) 

Exercise 2.1.2 A. If /: X ^ F is depicted by (2.2) above, write its image, im(/) as a 
set. 

Given a function f : X ^ Y and a function g: Y ^ where the codomain of / is 
the same set as the domain of g (namely F), we say that / and g are composable 

X — ^ Y Z. 

The composition of f and g is denoted by ^ o / : X ^ Z. 



X Y z 




Figure 2.4: Functions f: X ^ Y and g: Y ^ Z compose to a function g o f : X ^ Z; 
just follow the arrows. 

Let X and Y be sets. We write Homset(-^, ^) to denote the set of functions X ^Y. 
^ Note that two functions f^g:X^Y are equal if and only if for every element x e X 
we have f{x) = g{x). 

Exercise 2.1.2.5. Let A = {1, 2, 3, 4, 5} and B = {x, y}. 

a. ) How many elements does Homset(^7^) have? 

b. ) How many elements does Homset(^,^) have? 



Exercise 2.1.2.6. a.) Find a set A such that for all sets X there is exactly one element 
in Homset (-^7 ^) • Hint: draw a picture of proposed A's and X's. 

b.) Find a set B such that for all sets X there is exactly one element in Homset(^, -^)- 



For any set X, we define the identity function on X, denoted idx : X ^ X, to be 
the function such that for all x G X we have idx{x) = x. 

^This strange notation will make more sense later. 
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Definition 2.1.2.7. Let X and Y be sets. A function f : X ^ Y is called an isomor- 
phism^ denoted / : X ^ F, if there exists a function g: Y ^ X such that g o f = idx 
and f o g = idy . We also say that / is invertible and we say that g is the inverse of /. If 
there exists an isomorphism X ^ Y we say that X and Y are isomorphic sets and may 
write X ^Y. 

Example 2.1.2.8. If X and Y are sets and f : X y is an isomorphism then the 
analogue of Diagram 2.2 will look like a perfect matching, more often called a one-to- 
one correspondence. That means that no two arrows will hit the same element of F, 
and every element of Y will be in the image. For example, the following depicts an 
isomorphism X ^ Y. 



X Y 




Application 2.1.2.9. There is an isomorphism between the set Nucdna of nucleotides 
found in DNA and the set Nucrna of nucleotides found in RNA. Indeed both sets have 
four elements, so there are 24 different isomorphisms. But only one is useful. Before we 
say which one it is, let us say there is also an isomorphism Nucdna = {A^ C, G, T} and 
an isomorphism Nucrna = {A^ C^G^U}, and we will use the letters as abbreviations for 
the nucleotides. 

The convenient isomorphism Nucdna ^ Nucrna is that given by RNA transcription; 
it sends A ^ A,C ^ C,G ^ G,T ^ U . (See also Application 4.1.2.16.) 

Protein production can be modeled as a function from the set of 3-nucleotide se- 
quences to the set of eukaryotic amino acids. However, it cannot be an isomorphism 
because there are 4^ = 64 triplets of RNA nucleotides, but only 21 eukaryotic amino 
acids. 

00 

Exercise 2.1.2.10. Let n g M be a natural number and let X be a set with exactly n 
elements. 

a. ) How many isomorphisms are there from X to itself? 

b. ) Does your formula from part a.) hold when n = 07 



Lemma 2.1.2.11. The following facts hold about isomorphism. 

1. Any set A is isomorphic to itself; i.e. there exists an isomorphism A ^ A. 
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2. For any sets A and B, if A is isomorphic to B then B is isomorphic to A. 

3. For any sets A, 5, and C , if A is isomorphic to B and B is isomorphic to C then 
A is isomorphic to C. 

Proof. 1. The identity function id^ : A ^ A \^ invertible; its inverse is id^ because 
id^ oid^ = id^. 

2. li f : A ^ B is invertible with inverse g: B ^ A then g is an isomorphism with 
inverse /. 

3. If f : A ^ B and f'.B C are each invertible with inverses g: B ^ A and 
g' : C B then the following calculations show that f o f is invertible with 
inverse g o g'\ 

W of)o{go g') = f o{fog)og' = f o ids o g' = f o g' = idc 

{g ^ g') ^ if ^ f) = 9 ^ {g' ^ f) ^ f = 9 ^ ^<^B o f = g o f = id^ 

□ 

Exercise 2.1.2.12. Let A and B be the sets drawn below: 



A:= 



a 


7 


• 


• 




Q 




• 




Note that the sets A and B are isomorphic. Supposing that f : B ^ {1,2,3,4,5} sends 
"Bob" to 1, sends Jft to 3, and sends rS to 4, is there a canonical function A {1, 2, 3, 4, 5} 
corresponding to /? ^ 

Exercise 2.1.2.13. Find a set A such that for any set X there is a isomorphism of sets 

X^Homset(A^). 



Hint: draw a picture of proposed A's and X's. 
For any natural number n G M, define a set 

n:= {1,2,3,. 

So, in particular, 2 = {1,2}, 1 = {1}, and = 0. 

Let A be any set. A function f:n^A can be written as a sequence 



(2.6) 



/=(/(l),/(2),...,/(n)). 

Exercise 2.1.2.14. a.) Let A = {a, 6, c, d}. If / : 10 ^ A is given by (a, 6, c, c, 6, a, a, 6), 
what is /(4)? 



^Canonical means something like "best choice", a choice that stands out as the only reasonable one. 
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b.) Let s: 7 ^ M be given by s{i) = . Write s out as a sequence. 



Definition 2.1.2.15. [Cardinality of finite sets] 

Let A be a set and n e M a natural number. We say that A is has cardinality n, 
denoted 

\A\=n, 

if there exists an isomorphism of sets A = n. If there exists some n g M such that A has 
cardinality n then we say that A is finite. Otherwise, we say that A is infinite and write 
\A\ ^ 00. 

Exercise 2.1.2.16. a.) Let A = {5,6,7}. What is \A\1 

b. ) What is |M|? 

c. ) What is |{nGN I n ^ 5}|? 



Lemma 2.1.2.17. Let A and B he finite sets. If there is an isomorphism of sets f: A ^ 
B then the two sets have the same cardinality, \A\ = \B\. 

Proof. Suppose / : A ^ 5 is an isomorphism. If there exists natural numbers m^n e 

M and isomorphisms a: m ^ A and b: n ^ B then m — — > A ^ 5 ^ n is an 
isomorphism. One can prove by induction that the sets m and n are isomorphic if and 
only if m = n. 

□ 



2.2 Commutative diagrams 

At this point it is difficult to precisely define diagrams or commutative diagrams in 
general, but we can give the heuristic idea. ^ Consider the following picture: 

A^^B (2.7) 

9 

c 

We say this is a diagram of sets if each of A, 5, C is a set and each of /, ^, is a function. 
We say this diagram commutes \i g o f = k. In this case we refer to it as a commutative 
triangle of sets. 

Application 2.2.1.1. The central dogma of molecular biology is that "DNA makes RNA 
makes protein". That is, there is a function from DNA triplets to RNA triplets and a 
function from RNA triplets to amino acids. But sometimes we just want to discuss the 
translation from DNA to amino acids, and this is the composite of the other two. The 
commutative diagram is a picture of this fact. 

00 




^We will define commutative diagrams precisely in Section 4.5.2. 
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Consider the following picture: 




We say this is a diagram of sets if each of A, 5, C, I) is a set and each of /, /i, z is a 
function. We say this diagram commutes \i g o f = % o h. In this case we refer to it as a 
commutative square of sets. 

Application 2.2.1.2. Given a physical system 5, there may be two mathematical ap- 
proaches f : S ^ A and g: S ^ B that can be applied to it. Either of those results in 
a prediction of the same sort, f : A ^ P and g' : B ^ P. For example, in mechanics 
we can use either Lagrangian approach or the Hamiltonian approach to predict future 
states. To say that the diagram 

S ^A 



B ^P 

commutes would say that these approaches give the same result. 

00 

And so on. Note that diagram (2.7) is considered to be the same diagram as each of 
the following: 




2.3 Ologs 

In this course we will ground the mathematical ideas in applications whenever possible. 
To that end we introduce ologs, which will serve as a bridge between mathematics and 
various conceptual landscapes. The following material is taken from [SK], an introduction 



2.3. OLOGS 
to ologs. 
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D 



an amino acid 
found in dairy 



N 



an amine group 




an electrically- 
charged side 
chain 



a side chain 



a carboxylic acid 



2.3.1 Types 

A type is an abstract concept, a distinction the author has made. We represent each 
type as a box containing a singular indefinite noun phrase. Each of the following four 
boxes is a type: 



a man 



an automobile 



(2.9) 



a pair (a, k;), where w is 
a woman and a is an au- 
tomobile 



a pair (a, w) where w is 
a woman and a is a blue 
automobile owned by w 



Each of the four boxes in (2.9) represents a type of thing, a whole class of things, 
and the label on that box is what one should call each example of that class. Thus "^a 
man^ does not represent a single man, but the set of men, each example of which is 
called "a man". Similarly, the bottom right box represents an abstract type of thing, 
which probably has more than a million examples, but the label on the box indicates the 
common name for each such example. 

Typographical problems emerge when writing a text-box in a line of text, e.g. the 
text-box I a man | seems out of place here, and the more in-line text-boxes there are, the 
worse it gets. To remedy this, I will denote types which occur in a line of text with 
corner- symbols; e.g. I will write "^a man^ instead of | a man | . 

2.3.1.1 Types with compound structures 

Many types have compound structures; i.e. they are composed of smaller units. Exam- 
ples include 



a man and 
a woman 



a food portion / and 
a child c such that c 
ate all of / 



a triple (p^a^j) where p is 
a paper, a is an author of 
p, and j is a journal in 
which p was published 



(2.10) 
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It is good practice to declare the variables in a "compound type", as I did in the last 
two cases of (2.10). In other words, it is preferable to replace the first box above with 
something like 



a man m and 
a woman w 



a pair (m, w) 
where m is a man 
and It; is a woman 



so that the variables (m, w) are clear. 

Rules of good practice 2.3.1.2. A type is presented as a text box. The text in that box 
should 

(i) begin with the word "a" or "an"; 

(ii) refer to a distinction made and recognizable by the olog's author; 

(iii) refer to a distinction for which instances can be documented; 

(iv) declare all variables in a compound structure. 

The first, second, and third rules ensure that the class of things represented by 
each box appears to the author as a well-defined set. The fourth rule encourages good 
"readability" of arrows, as will be discussed next in Section 2.3.2. 

I will not always follow the rules of good practice throughout this document. I think 
of these rules being followed "in the background" but that I have "nicknamed" various 
boxes. So "^Steve^ may stand as a nickname for "^a thing classified as Steve^ and Wginine^ 
as a nickname for "^a molecule of arginine\ However, when pressed, one should always 
be able to rename each type according to the rules of good practice. 



2.3.2 Aspects 

An aspect of a thing x is a way of viewing it, a particular way in which x can be regarded 
or measured. For example, a woman can be regarded as a person; hence "being a person" 
is an aspect of a woman. A molecule has a molecular mass (say in daltons), so "having 
a molecular mass" is an aspect of a molecule. In other words, by aspect we simply mean 
a function. The domain A of the function f : A ^ B is the thing we are measuring, and 
the codomain is the set of possible "answers" or results of the measurement. 

(2.11) 



a woman 



a person 



a molecule 



has as molecular mass (Da) 



a positive real number 



(2.12) 



So for the arrow in (2.11), the domain is the set of women (a set with perhaps 3 billion 
elements); the codomain is the set of persons (a set with perhaps 6 billion elements). 
We can imagine drawing an arrow from each dot in the "woman" set to a unique dot in 
the "person" set, just as in (2.2). No woman points to two different people, nor to zero 
people — each woman is exactly one person — so the rules for a function are satisfied. 
Let us now concentrate briefly on the arrow in (2.12). The domain is the set of molecules, 
the codomain is the set R>o of positive real numbers. We can imagine drawing an arrow 
from each dot in the "molecule" set to a single dot in the "positive real number" set. No 
molecule points to two different masses, nor can a molecule have no mass: each molecule 
has exactly one mass. Note however that two different molecules can point to the same 
mass. 
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2.3.2.1 Invalid aspects 

I tried above to clarify what it is that makes an aspect "valid", namely that it must be 
a "functional relationship." In this subsection I will show two arrows which on their face 
may appear to be aspects, but which on closer inspection are not functional (and hence 
are not valid as aspects). 

Consider the following two arrows: 



a person 


has 


a child 





a mechanical pencil 



a piece of lead 



(2.14*) 



A person may have no children or may have more than one child, so the first arrow is 
invalid: it is not a function. Similarly, if we drew an arrow from each mechanical pencil 
to each piece of lead it uses, it would not be a function. 

Warning 2.3.2.2. The author of an olog has a world- view, some fragment of which is 
captured in the olog. When person A examines the olog of person B, person A may or 
may not "agree with it." For example, person B may have the following olog 




which associates to each marriage a man and a woman. Person A may take the position 
that some marriages involve two men or two women, and thus see B's olog as "wrong." 
Such disputes are not "problems" with either A's olog or B's olog, they are discrepancies 
between world- views. Hence, throughout this paper, a reader R may see a displayed olog 
and notice a discrepancy between R's world-view and my own, but R should not worry 
that this is a problem. This is not to say that ologs need not follow rules, but instead 
that the rules are enforced to ensure that an olog is structurally sound, rather than that 
it "correctly reflects reality," whatever that may mean. 

Consider the aspect "^an object^ — has_^ weight\ At some point in history, this 
would have been considered a valid function. Now we know that the same object 
would have a different weight on the moon than it has on earth. Thus as world- 
views change, we often need to add more information to our olog. Even the validity 
of "^an object on earth^ — has_^ weight^ is questionable. However to build a model 
we need to choose a level of granularity and try to stay within it, or the whole model 
evaporates into the nothingness of truth! 

Remark 2.3.2.3. In keeping with Warning 2.3.2.2, the arrows (2.13*) and (2.14*) may 
not be wrong but simply reflect that the author has a strange world-view or a strange 
vocabulary. Maybe the author believes that every mechanical pencil uses exactly one 
piece of lead. If this is so, then "^a mechanical penciF ^^^^> "^a piece of lead^ is indeed a 
valid aspect! Similarly, suppose the author meant to say that each person was once a 
child, or that a person has an inner child. Since every person has one and only one inner 



child (according to the author), the map "^a person^ 



has as inner child 



■^a child^ is a valid 



aspect. We cannot fault the olog if the author has a view, but note that we have changed 
the name of the label to make his or her intention more explicit. 
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2.3.2.4 Reading aspects and paths as English phrases 



Each arrow (aspect) X ^ Y can be read by first reading the label on its source box 
(domain of definition) X, then the label on the arrow /, and finally the label on its 
target box (set of values) Y. For example, the arrow 



a book 



has as first author 



a person 



(2.15) 



is read "a book has as first author a person". 

Remark 2.3.2.5. Note that the map in (2.15) is a valid aspect, but that a similarly 



benign-looking map "^a book^ 



has as author 



■^a person^ would not be valid, because it is 



not functional. The authors of an olog must be vigilant about this type of mistake 
because it is easy to miss and it can corrupt the olog. 

Sometimes the label on an arrow can be shortened or dropped altogether if it is 
obvious from context. We will discuss this more in Section 2.3.3 but here is a common 
example from the way I write ologs. 



a pair (x, y) where 
X and y are integers 





an integer 



an integer 



(2.16) 



Neither arrow is readable by the protocol given above (e.g. "a pair (x, y) where x and 
y are integers x an integer" is not an English sentence), and yet it is obvious what each 
map means. For example, given (8, 11) in A, arrow x would yield 8 and arrow y would 
yield 11. The label x can be thought of as a nickname for the full name "yields, via the 
value of X," and similarly for y. I do not generally use the full name for fear that the 
olog would become cluttered with text. 

One can also read paths through an olog by inserting the word "which" after each 
intermediate box. ^ For example the following olog has two paths of length 3 (counting 
arrows in a chain): 



a child 


IS 

^ 


a person 





has as parents 




a pair (w, m) 
where is a 
woman and m 



a woman 



a date 


includes 


a year 





(2.17) 



The top path is read "a child is a person, who has as parents a pair {w^ m) where is a 
woman and m is a man, which yields, via the value of k;, a woman." The reader should 



"If the intended elements of an intermediate box are humans, it is pohte to use "who" rather than 
"which", and other such conventions may be upheld if one so desires. 
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read and understand the content of the bottom path, which associates to every child a 
year. 



2.3.2.6 Converting non- functional relationships to aspects 



There are many relationships that are not functional, and these cannot be considered 
aspects. Often the word "has" indicates a relationship — sometimes it is functional as 



has 



in "a person" > "^a stomach^, and sometimes it is not, as in "^a father^ "^a child\ 
Obviously, a father may have more than one child. This one is easily fixed by realizing 
that the arrow should go the other way: there is a function "^a child^ — ^ "^a father\ 



What about "^a person^ 



"^a car^. Again, a person may own no cars or more than 



one car, but this time a car can be owned by more than one person too. A quick fix 
would be to replace it by "^a person^ _owns^ cars\ This is ok, but the relationship 

between ""a car^ and "^a set of cars^ then becomes an issue to deal with later. There is 
another way to indicate such "non-functional" relationships. In this case it would look 
like this: 



a pair (p, c) where 
p is a person, c is a 
car, and p owns c. 





This setup will ensure that everything is properly organized. In general, relationships 
can involve more than two types, and the general situation looks like this 
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For example, 



a paper 



a sequence (p, a, j) where p 
is a paper, a is an author 
of p, and j is a journal in 
which p was published 



an author 



a journal 



Exercise 2.3.2.7. On page 23 we indicate a so-called invalid aspect, namely 



a person 


has 


a child 





(2.13*) 



Create a (valid) olog that captures the parent-child relationship; your olog should still 
have boxes "^a person^ and "^a child^ but may have an additional box. 

Rules of good practice 2.3.2.8. An aspect is presented as a labeled arrow, pointing from 
a source box to a target box. The arrow text should 



(i) begin with a verb; 



(ii) yield an English sentence, when the source-box text followed by the arrow text 
followed by the target-box text is read; and 

(iii) refer to a functional relationship: each instance of the source type should give rise 
to a specific instance of the target type. 



2.3.3 Facts 

In this section I will discuss facts, which are simply "path equivalences" in an olog. It is 
the notion of path equivalences that make category theory so powerful. 

A path in an olog is a head- to- tail sequence of arrows. That is, any path starts at 
some box Bq, then follows an arrow emanating from Bq (moving in the appropriate 
direction), at which point it lands at another box 5i, then follows any arrow emanating 
from 5i, etc, eventually landing at a box Bn and stopping there. The number of arrows 
is the length of the path. So a path of length 1 is just an arrow, and a path of length 
is just a box. We call the source and Bn the target of the path. 

Given an olog, the author may want to declare that two paths are equivalent. For 
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example consider the two paths from A to C in the olog 
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a person 



has as parents 



a pair {w^ m) 
where w \^ 
woman and 
m is a man 



has as mother 



yields as w 



C 



a woman 



(2.18) 



We know as Enghsh speakers that a woman parent is cahed a mother, so these two paths 
A ^ C should be equivalent. A more mathematical way to say this is that the triangle in 
Olog (2.18) commutes. That is, path equivalences are simply commutative diagrams as 
in Section 2.2. In the example above we concisely say "a woman parent is equivalent to 
a mother." We declare this by defining the diagonal map in (2.18) to be the composition 
of the horizontal map and the vertical map. 

I generally prefer to indicate a commutative diagram by drawing a check-mark, 
in the region bounded by the two paths, as in Olog (2.18). Sometimes, however, one 
cannot do this unambiguously on the 2-dimensional page. In such a case I will indicate 
the commutative diagrams (fact) by writing an equation. For example to say that the 
diagram 




commutes, we could either draw a checkmark inside the square or write the equation 
A f g ^ A h i above it. Either way, it means that "/ then ^" is equivalent to "/i then z". 
Here is another, more scientific example: 



a DNA sequence 



is transcribed to 



an RNA sequence 



codes for 



is translated to 



a protein 



Note how this diagram gives us the established terminology for the various ways in which 
DNA, RNA, and protein are related in this context. 

Exercise 2.3.3.1. Create an olog for human nuclear biological families that includes the 
concept of person, man, woman, parent, father, mother, and child. Make sure to label 
all the arrows, and make sure each arrow indicates a valid aspect in the sense of Section 
2.3.2.1. Indicate with check- marks (/) the diagrams that are intended to commute. If the 
2-dimensionality of the page prevents a check-mark from being unambiguous, indicate 
the intended commutativity with an equation. 
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Example 2.3.3.2 (Non-commuting diagram). In my conception of the world, the fohowing 
diagram does not commute: 




The non-commutativity of Diagram (2.19) does not imply that, in my conception, no 
person lives in the same city as his or her father. Rather it implies that, in my conception, 
it is not the case that every person lives in the same city as his or her father. 

Exercise 2.3.3.3. Create an olog about a scientific subject, preferably one you think 
about often. The olog should have at least five boxes, five arrows, and one commutative 
diagram. 



2.3.3.4 A formula for writing facts as English 

Every fact consists of two paths, say P and Q, that are to be declared equivalent. The 
paths P and Q will necessarily have the same source, say 5, and target, say t, but their 
lengths may be different, say m and n respectively. ^ We draw these paths as 

^ ao=s fi ai f2 a2 fs fm-i am-i fm am=t _ 

P : • ^ • ^ • ^ • • • ^ • ^ • (2.20) 

bo = s gi hi 92 62 93 9n-l bn-i 9n bn=t 

Q : • ^ • ^ • ^ • • • ^ • ^ • 

Every part i of an olog (i.e. every box and every arrow) has an associated English phrase, 
which we write as Using a dummy variable x we can convert a fact into English too. 
The following general formula is a bit difficult to understand, see Example 2.3.3.5, but 
here goes. The fact P ^ Q from (2.20) can be Englishified as follows: 



Given x, "5", consider the following. We know that x is "s", (2.21) 

which "/i" which '72" "^2", which ... "/m-i" "am-i", which "/m" 

that we'h can P{x). 

We also know that x is "5", 

which "61", which "^2" "^2", which ... "^'n-i" "^n-i", which "t" 
that we'll call Q{x). 

Fact: whenever x is "s", we will have P{x) = Q(x). 



^If the source equals the target, s ^ then it is possible to have m = or n = 0, and the ideas below 
still make sense. 
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A 

a person 



has 



an address 



lives in 



a city 



(2.22) 



To put the fact that Diagram 2.22 commutes into English, we first Englishify the two 
paths: F="a person has an address which is in a city" and G="a person lives in a city". 
The source of both is 5="a person" and the target of both is t="a city", write: 

Given x, a person, consider the following. We know that x is a person, 

which has an address, which is in a city 

that we'll can P{x). 

We also know that x is a person, 

which lives in a city 

that we'll call Q{x). 

Fact: whenever x is a person, we will have P{x) = Q{x). 
Exercise 2.3.3.6. This olog was taken from [Spl]. 



N 




a phone number 



has 



C 



an area code 



is assigned 



OLP 



corresponds to 



an operational land- 
line phone 



R 



a physical phone 



is currently 
located in 



a region 



(2.23) 



It says that a landline phone is physically located in the region that its phone number 
is assigned. Translate this fact into English using the formula from 2.21. 

Exercise 2.3.3.7. In the above olog (2.23), suppose that the box "^an operational landline 
phone^ is replaced with the box "^an operational mobile phone^ . Would the diagram still 
commute? 



2.3.3.8 Images 

In this section we discuss a specific kind of fact, generated by any aspect. Recall that 
every function has an image, meaning the subset of elements in the codomain that are 
"hit" by the function. For example the function f{x) = 2 * x: Z ^ Z has as image the 
set of all even numbers. 

Similarly the set of mothers arises as is the image of the "has as mother" function. 
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as shown below 



a person 




has as mother 




a person 



M=im(/) 



a mother 



Exercise 2.3.3.9. For each of the fohowing types, write down a function for which it is 
the image, or say "not clearly an image type" 

a. ) "^a book^ 

b. ) "^a material that has been fabricated by a process of type 

c. ) "^a bicycle owner^ 

d. ) "^a child^ 

e. ) "^a used book^ 

f. ) "^an inhabited residence^ 



2.4 Products and coproducts 

In this section we introduce two concepts that are likely to be familiar, although perhaps 
not by their category-theoretic names, product and coproduct. Each is an example of a 
large class of ideas that exist far beyond the realm of sets. 



2.4.1 Products 

Definition 2.4.1.1. Let X and Y be sets. The product of X and F, denoted X xY, is 
defined as the set of ordered pairs (x, y) where x e X and y e Y. Symbolically, 

XxY={{x,y)\xeX, yeY}. 

There are two natural projection functions tti : X x Y ^ X and 712'. X x Y ^ Y. 

X xY 




Example 2.4.1.2. [Grid of dots] 

Let X = {1, 2, 3, 4, 5, 6} and Y = {*, 0, ^, Then we can draw X x F as a 6-by-4 
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grid of dots, and the projections as projections 







X 


X Y 






Y 




(1,*) 

• 


(2,*) 
• 


(3,*) 
• 


(4,*) 
• 


(5,*) 
• 


(6,*) 
• 




• 




(1,0) 
• 


(2,0) 
• 


(3,0) 
• 


(4,0) 
• 


(5,0) 
• 


(6,0) 
• 





• 


(2.24) 


• 


(2,^) 
• 


(3,^) 
• 


(4,^) 
• 


• 


(6,^) 
• 




• 


• 


(2» 
• 


(3» 
• 


(4,*) 
• 


(5,-!fc) 

• 


• 




• 





1 2 3 4 5 6 

• ••••• 



X 



Application 2.4.1.3. A traditional (Mendelian) way to predict the genotype of offspring 
based on the genotype of its parents is by the use of Punnett squares. If F is the set of 
possible genotypes for the female parent and M is the set of possible genotypes of the 
male parent, then F x M is drawn as a square, called a Punnett square, in which every 
combination is drawn. 00 

Exercise 2.4.1.4. How many elements does the set {a, 6, c, (i} x {1,2,3} have? 

Application 2.4.1.5. Suppose we are conducting experiments about the mechanical prop- 
erties of materials, as in Application 2.1.2.1. For each material sample we will produce 
multiple data points in the set "^extension^ x Torce^ = R x R. 

00 

Remark 2.4.1.6. It is possible to take the product of more than two sets as well. For 
example, if A, 5, and C are sets then A x B x C is the set of triples, 

A X B X C := {(a, b,c) \ a e A,b e B,c e C}. 

This kind of generality is useful in understanding multiple dimensions, e.g. what 
physicists mean by 10-dimensional space. It comes under the heading of limits, which 
we will see in Section 4.5.3. 

Example 2.4.1.7. Let R be the set of real numbers. By R^ we mean R x R (though see 
Exercise 2.7.2.6). Similarly, for any n e N, we define R^ to be the product of n copies of 
R. 

According to [Pen], Aristotle seems to have conceived of space as something like 
S := M.^ and of time as something like T := R. Spacetime, had he conceived of it, 
would probably have been 6* x T ^ R^. He of course did not have access to this kind of 
abstraction, which was probably due to Descartes. 
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Exercise 2.4.1.8. Let Z denote the set of integers, and let + : Z x Z — > Z denote the 
addition function and • : Z x Z ^ Z denote the multiplication function. Which of the 
following diagrams commute? 



a.) 



X Z X Z — ■ — ^Z X 



(a,5,c)i-^(a+6,c) 



b.) 



c.) 



{x,y)^x+y 



{x,y)^xy 



x\-^{x,0) 




x\-^{x,l) 



{a,b)\-^a-b 




{a,b)^a-b 



2.4.1.9 Universal property for products 

Lemma 2.4.1.10 (Universal property for product). Let X and Y be sets. For any set 
A and functions f : A ^ X and g: A ^ Y , there exists a unique function A ^ X x Y 
such that the following diagram commutes ^ 



X xY 



(2.25) 



y Y 




We might write the unique function as 

f X g: A^ X xY. 

Proof. Suppose given f^g as above. To provide a function i: A ^ X xY is equivalent 
to providing an element i{a) g X x Y" for each a e A. We need such a function for which 



^The symbol V is read "for all"; the symbol 3 is read "there exists", and the symbol 3! is read "there 
exists a unique". So this diagram is intended to express the idea that for any functions f : A ^ X and 
g: A ^ Y, there exists a unique function A ^ X x Y for which the two triangles commute. 
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TTi o £ = / and 7T2 o i = g. An element of X x Y" is an ordered pair (x, y), and we can 
use £{a) = (x^y) if and only if x = 7ri{x,y) = f{a) and y = 7r2{x,y) = g{a). So it is 
necessary and sufficient to define 

(/x<7)(a) = (/(a),5(a)) 

for all a E A. 

□ 

Example 2.4.1.11. [Grid of dots, continued] 

We need to see the universal property of products as completely intuitive. Recall that 
if X and Y are sets, say of cardinalities \X\ = m and \Y\ = n respectively, then X x F is 
an m X n grid of dots, and it comes with two canonical projections X X xY Y . 
These allow us to extract from every grid element z e X xY lis column iti{z) g X and 
its row TT2{z) G Y . 

Suppose that each person in a classroom picks an element of X and an element of 
Y . Thus we have functions f'.C^X and g: C ^Y. But isn't picking a column and a 
row the same thing as picking an element in the grid? The two functions / and g induce 
a unique function C ^ X x Y . And how does this function C ^ X xY compare with 
the original functions / and gl The commutative diagram (2.25) sums up the obvious 
connection. 

Example 2.4.1.12. Let R be the set of real numbers. The origin in R is an element of R. 
As you showed in Exercise 2.1.2.13, we can view this (or any) element of R as a function 
z: {©} R, where {©} is any set with one element. Our function z "picks out the 
origin". Thus we can draw functions 

{©} 



R R 

The universal property for products guarantees a function {©} ^ R x R, which will be 
the origin in R^. 

Remark 2.4.1.13. Given sets X, F, and A, and functions f: A ^ X and g: A ^Y^ there 
is a unique function A ^ X x Y that commutes with / and g. We call it the induced 
function A ^ X xY ^ meaning the one that arises in light of / and g. 

Exercise 2.4.1.14. For every set A there is some nice relationship between the following 
three sets: 

Homset {A, X) , Homset {A,Y), and Homset [A, X xY). 
What is it? 

Hint: Do not be alarmed: this problem is a bit "recursive" in that you'll use products 
in your formula. 

Exercise 2.4.1.15. a.) Let X and Y be sets. Construct the "swap map" s: X xY ^ 
F X X using only the universal property for products. If tti : X x F ^ X and 
7r2 : X X F ^ F are the projection functions, write s in terms of the symbols tti, 7r2, x , 
and o. 
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b.) Can you prove that 5 is a isomorphism using only the universal property for product? 



Exercise 2.4.1.16. Suppose given sets A, A', B' and functions /: A ^ A' and g: B ^ 
B' . Use the universal property of products to construct a function s\ Ax B ^ A' x B' . 
li TTi: A X B ^ A and tt2'- A x B ^ B are the projection functions, write s in terms of 
the symbols f , 711,712, and x. 



2.4.1.17 Ologging products 

Given two objects c, d in an olog, there is a canonical label "c x (i" for their product cx 
written in terms of the labels "c" and "d". Namely, 



c X (i" := a pair (x, y) where x is "c" and y is "(i". 



The projections c <^ c x d ^ d can be labeled "yields, as x," and "yields, as respec- 
tively. 

Suppose that e is another object and p: e ^ c and q: e ^ d are two arrows. By 
the universal property of products (Lemma 2.4.1.10), p and q induce a unique arrow 
e ^ c X d making the evident diagrams commute. This arrow can be labeled 



yields, insofar as it "p" "c" and "g" "d". 



Example 2.4.1.18. Every car owner owns at least one car, but there is no obvious function 
■^a car owner^ "^a car^ because he or she may own more than one. One good choice 
would be the car that the person drives most often, which we'll call his or her primary 
car. Also, given a person and a car, an economist could ask how much utility the person 
would get out of the car. From all this we can put together the following olog involving 
products: 



a car owner 



yields, insofar 
as it is a person 
and owns, as 
-.primary, a car. 



PxC 



a pair (x, y) 
where x is a 
person and y is 
a car 




has as associ- 
ated utility 



a dollar value 



yields, as y, 



2.4.2 Coproducts 

Definition 2.4.2.1. Let X and Y be sets. The coproduct of X and F, denoted X uY, 
is defined as the "disjoint union" of X and F, i.e. the set for which an element is either 
an element of X or an element of Y. If something is an element of both X and Y then 
we include both copies, and distinguish between them, in X uY. See Example 2.4.2.2 
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There are two natural inclusion functions ii : X ^ X uY and 12'. Y ^ X uY. 



X 



Y 





X uY 



Example 2.4.2.2. The coproduct of X := {a, b, c, d} and Y := {1, 2, 3} is 



X uY ^ {a,&,c,(i,l,2,3}. 



The coproduct of X and itself is 



X u X ^ {iia, iic, iid, i2a, ^2^, i2C, ^2^^} 



The names of the elements in X uY are not so important. What's important are the 
inclusion maps ii,i2, which ensure that we know where each element oi X u Y came 
from. 



Example 2.4.2.3. [Airplane seats] 



X 



an economy- 
class seat in 
an airplane 



a first-class 
seat in an 
airplane 



(2.26) 



XuY 

a seat in an 
airplane 



Exercise 2.4.2.4. Would you say that "^a phone^ is the coproduct of "^a cellphone^ and "^a 
landline phone^? 



36 



CHAPTER 2. THE CATEGORY OF SETS 



Example 2.4.2.5 (Disjoint union of dots). 



X uY 



Y 





1 


2 


3 


4 


5 


6 






• 


• 


• 


• 


• 


• 


• 




• 






















• 
















• 




















• 
















• 




















• 
















• 



(2.27) 



X 



2.4.2.6 Universal property for coproducts 

Lemma 2.4.2.7 (Universal property for coproduct). Let X and Y be sets. For any set 
A and functions f : X ^ A and g: Y ^ A, there exists a unique function X uY ^ A 
such that the following diagram commutes 




XuY 

We might write the unique function as 

f ug: X uY ^ A. 

Proof Suppose given f^g as above. To provide a function i: X uY ^ A is equivalent 
to providing an element f{m) e ^4 is for each m e X uY . We need such a function such 
that £ o ii = f and £o 12 = g. But each element m e X uY is either of the form iix or 
i2y, and cannot be of both forms. So we assign 



{f ug){m) = 




ii m = iix, 
if m = i2y- 
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This assignment is necessary and sufficient to make all relevant diagrams commute. 

□ 

Example 2.4.2.8 (Airplane seats, continued). The universal property of coproducts says 
the following. Any time we have a function X ^ A and a function Y ^ A, we get a 
unique function X uY ^ A. For example, every economy class seat in an airplane and 
every first class seat in an airplane is actually in a particular airplane. Every economy 
class seat has a price, as does every first class seat. 



has as price 



an economy- 
class seat in 
an airplane 




(2.28) 



a first-class 
seat in an 
airplane 



The universal property of coproducts formalizes the following intuitively obvious fact: 

If we know how economy class seats are priced and we know how first class 
seats are priced, and if we know that every seat is either economy class or 
first class, then we automatically know how all seats are priced. 

To say it another way (and using the other induced map): 

If we keep track of which airplane every economy class seat is in and we 
keep track of which airplane every first class seat is in, and if we know that 
every seat is either economy class or first class, then we require no additional 
tracking for any airplane seat whatsoever. 

Application 2.4.2.9 (Piecewise defined curves). In science, curves are often defined or 
considered piecewise. For example in testing the mechanical properties of a material, 
we might be interested in various regions of deformation, such as elastic, plastic, or 
post-fracture. These are three intervals on which the material displays different kinds of 
properties. 

For real numbers a < 6 G R, let [a, 6] := {x g R | a ^ x ^ 6} denote the closed 
interval. Given a function [a, 6] ^ R and a function [c, d] R, the universal property 
of coproducts implies that they extend uniquely to a function [a, 6] u [c, d] R, which 
will appear as a piecewise defined curve. 

Often we are given a curve on [a, b] and another on [6, c], where the two curves agree 
at the point b. This situation is described by pushouts, which are mild generalizations 
of coproducts; see Section 2.6.2. 

00 

Exercise 2.4.2.10. Write the universal property for coproduct in terms of a relationship 
between the following three sets: 



Homset {X, A) , Homset {Y, A) , and Homset {X uY,A). 
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Example 2.4.2.11. In the following olog the types A and B are disjoint, so the coproduct 
C = A u B is just the union. 



A 



C=AuB 



a person 



^ a person or a cat 





B 


IS 


a cat 





Example 2.4.2.12. In the following olog, A and B are not disjoint, so care must be taken 
to differentiate common elements. 



C=AljB 



an animal 
that can fly 



labeled "A" is 



an animal that can fly 
(labeled "A") or an 
animal that can swim 
(labeled "B") 



labeled "B" is 



an animal that 
can swim 



Since ducks can both swim and fly, each duck is found twice in C, once labeled as a 
flyer and once labeled as a swimmer. The types A and B are kept disjoint in C, which 
justifies the name "disjoint union." 

Exercise 2.4.2.13. Understand Example 2.4.2.12 and see if a similar idea would make 
sense for particles and waves. Make an olog, and choose your wording in accordance with 
Rules 2.3.1.2. How do photons, which exhibit properties of both waves and particles, fit 
into the coproduct in your olog? 



Exercise 2.4.2.14. Following the section above, "Clogging products" page 34, come up 
with a naming system for coproducts, the inclusions, and the universal maps. Try it out 
by making an olog (involving coproducts) discussing the idea that both a .wav file and 
a .mp3 file can be played on a modern computer. Be careful that your arrows are valid 
in the sense of Section 2.3.2.1. 



2.5 Finite limits in Set 

In this section we discuss what are called limits of variously-shaped diagrams of sets. 
We will make all this much more precise when we discuss limits in arbitrary categories 
in Section 4.5.3. 

2.5.1 Fullbacks 

Definition 2.5.1.1 (Fullback). Suppose given the diagram of sets and functions below. 

Y (2.29) 

9 



Its fiber product is the set 



X xzY := {{x,w,y) \ f{x) = w = g{y)}. 
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There are obvious projections tti : X x zY ^ X and 7r2 : X xzY 
y). Note that if W = X xzY then the diagram 



Y (e.g. iT2{x,w,y) 



W 



Y 



Y 



(2.30) 



/ 



commutes. Given the setup of Diagram 2.29 we define the pullback of X and Y over Z 
to be any set W for which we have an isomorphism W ^ X xzY . The corner symbol 
^ in Diagram 2.30 indicates that W is the puhback. 



Exercise 2.5.1.2. Let X, F, Z be as drawn and /: X 
functions. 



X 



Z and g:Y ^ Z the indicated 




What is the puhback of the diagram X 



f 



Y7 



Exercise 2.5.1.3. a.) Draw a set X with five elements and a set Y with three elements. 
Color each element of X and each element of Y either red, blue, or yellow, ^ and do 
so in a "random-looking" way. Considering your coloring of X as a function X ^ C, 
where C = {red, blue, yellow}, and similarly obtaining a function F ^ C, draw the 
fiber product X xqY. Make sure it is colored appropriately. 

b.) The universal property for products guarantees a function X xcY ^ X x Y^ which 
I can tell you will be an injection. This means that the drawing you made of the 
fiber product can be imbedded into the 5x3 grid; please draw the grid and indicate 
this subset. 



Remark 2.5.1.4. Some may prefer to denote this fiber product hy f Xz g rather than 
X XzY. The former is mathematically better notation, but human-readability is often 
enhanced by the latter, which is also more common in the literature. We use whichever 
is more convenient. 

Exercise 2.5.1.5. a.) Suppose that Y = 0; what can you say about X xz Yl 

b.) Suppose now that Y is any set but that Z has exactly one element; what can you 
say about X xzYl 





You can use shadings rather than coloring, if coloring would be annoying. 
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Exercise 2.5.1.6. Let S = R^,T = R, and think of them as (Aristotihan) space and time, 
with the origin in S x T given by the center of mass of MIT at the time of its founding. 
Let Y = S xT and let : F ^ 5* be one projection and Y ^ T the other projection. 
Let X = {©} be a set with one element and let /i : X ^ 5 and /2 : X ^ T be given by 
the origin in both cases. 



a.) What are the fiber products Wi and 1^2- 



Wi 



X 



Y 



91 



fi 



W2 



Y 



92 



X 

/2 



b.) Interpret these sets in terms of the center of mass of MIT at the time of its founding. 





2.5.1.7 Using pullbacks to define new ideas from old 

In this section we will see that the fiber product of a diagram can serve to define a new 
concept. For example, in (2.33) we define what it means for a cellphone to have a bad 
battery, in terms of the length of time for which it remains charged. By being explicit, 
we reduce the chance of misunderstandings between different groups of people. This can 
be useful in situations like audits and those in which one is trying to reuse or understand 
data gathered by others. 

Example 2.5.1.8. Consider the following two ologs. The one on the right is the pullback 
of the one on the left. 



c 



a loyal 
customer 



a wealthy 
customer 



D 



a customer 



A=BxdC 



a customer 
that is wealthy 
and loyal 



c 



a loyal 
customer 



Y 
B 



a wealthy 
customer 



D 



a customer 



(2.31) 

Check from Definition 2.5.1.1 that the label, "a customer that is wealthy and loyal", is 
fair and straightforward as a label for the fiber product A = B X]j given the labels 
on 5, C, and D. 

Remark 2.5.1.9. Note that in Diagram (2.31) the top- left box could have been (non- 
canonically named) "^a good customer^ If it was taken to be the fiber product, then the 
author would be effectively defining a good customer to be one that is wealthy and loyal. 

Exercise 2.5.1.10. For each of the following, an author has proposed that the diagram 
on the right is a pullback. Do you think their labels are appropriate or misleading; that 
is, is the label on the upper-left box reasonable given the rest of the olog, or is it suspect 
in some way? 
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a.) 



a person 



has as favorite 
color 



blue 



a color 



a person whose 
favorite color is blue 



a person 



has as favorite 
color 



has as favorite 
color 



blue 



a color 



b.) 



a dog 



has as owner 



a woman 



D 



a person 



a dog whose owner 
is a woman 



has as owner 



has as owner 



a woman 



D 



a person 



c.) 



a space m 
our house 



a piece of 


furniture 




has 







a width 



a good fit 



a space m 
our house 



a piece of 
furniture 



a width 



Exercise 2.5.1.11. a.) Consider your olog from Exercise 2.3.3.1. Are any of the commu- 
tative squares there actuahy puhback squares? 

b.) Now use ologs with products and pullbacks to define what a brother is and what a 
sister is (again in a human biological nuclear family), in terms of types such as "^an 
offspring of mating pair (a, "^a person^, "^a male person^ "^a female person^ and 
so on. 





Definition 2.5.1.12 (Preimage). Let /: X ^ F be a function and y e Y slu element. 

The preimage of y under /, denoted f~^{y)^ is the subset f~^{y) := {x e X \ f{x) = y}. 

Exercise 2.5.1.13. Let / : X ^ y be a function and y g F an element. Draw a pullback 
diagram in which the fiber product is isomorphic to the preimage f~^{y). 
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Lemma 2.5.1.14 (Universal property for pullback). Suppose given the diagram of sets 
and functions as below. 

Y 

u 

X 

t 

For any set A and commutative solid arrow diagram as below ( i. e. functions f: A ^ X 
and g: A ^Y such that t o f = u o g), 

XxzY (2.32) 




X Y 




Z 



there exists a unique arrow fxzg'-A^XxzY making everything commute, 

f = 7rio{fxzg) and g = o {f x z g). 

Exercise 2.5.1.15. Create an olog whose underlying shape is a commutative square. Now 
add the fiber product so that the shape is the same as that of Diagram (2.32). Assign 

English labels to the projections 7ri,7r2 and to the dotted map A •^^^^> X F, such 
that these labels are as canonical as possible. 



2.5.1.16 Pasting diagrams for pullback 

Consider the diagram drawn below, which includes a left-hand square, a right-hand 
square, and a big rectangle. 

A' B' a 

13 k 
\f \f \f 

A 

f 3 

The right-hand square has a corner symbol indicating that B' = B xc C \^ d^. pullback. 
But the corner symbol on the left is ambiguous; it might be indicating that the left-hand 
square is a pullback, or it might be indicating that the big rectangle is a pullback. It 
turns out that \i B' ^ B xc C then it is not ambiguous because the left-hand square is 
a pullback if and only if the big rectangle is. 
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Proposition 2.5.1.17. Consider the diagram drawn below 

B' 

3 ' k 

A 

f s 

where B' = B xc C is a pullback. Then there is an isomorphism AxbB' = AxcC'. 
Said another way, 

Axb{BxcC')^AxcC'. 

Proof. We first provide a map (j): A xb {B xc C) A xc C . An element oi A xb 
{B Xc C) is of tiie form (a, 6, (6, c, c')) sucti ttiat /(a) = b^g{b) = c and k{c') = c. But 
tiiis implies that g o /(a) = c = k{c') so we put (j){a^b^ {b^c^c')) := {a^c^c') e A xc C . 
Now we provide a proposed inverse, AxcC'^Axb{B xcC). Given (a, c, d) with 
g o /(a) = c = k{d)^ let b = f{a) and note that (6, c, d) is an element of B xc C . So we 
can define ?/;(a, c, d) = (a, 6, (6, c, c')). It is easy to see that (j) and are inverse. 

□ 



Proposition 2.5.1.17 can be useful in authoring ologs. For example, the type "^a 
cellphone that has a bad battery^ is vague, but we can lay out precisely what it means 
using pullbacks: 



a cellphone that 
has a bad battery 



a bad battery 



less than 
1 hour 


^ 


between 
and 1 





(2.33) 



a cellphone 



a battery 



remains 
charged 
for 



a duration 
of time 



yields 



a range of 
numbers 



The category-theoretic fact described above says that since A = B x jj C and C = 
D xp \i follows that A = B xp E. That is, we can deduce the definition "a cellphone 
that has a bad battery is defined as a cellphone that has a battery which remains charged 
for less than one hour." 

Exercise 2.5.1.18. a.) Create an olog that defines two people to be "of approximately 
the same height" if and only if their height difference is less than half an inch, using 
a pullback. Your olog can include the box "^a real number x such that —.5 < x < .5\ 

b.) In the same olog, make a box for those people whose height is approximately the 
same as a person named "The Virgin Mary". You may need to use images, as in 
Section 2.3.3.8. 
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Exercise 2.5.1.19. Consider the diagram on the left below, where both squares commute. 

Y' W ^ Y' 



X 



Y 



Z' 




Let W = X ^zY and W = X' xz' Y' , and form the diagram to the right. Use the 
universal property of fiber products to construct a map W W such that all squares 
commute. 



2.5.2 Spans, experiments, and matrices 

Definition 2.5.2.1. Given sets A and 5, a s^an on A and 5 is a set R together with 
functions f : R ^ A and g: R ^ B. 




Application 2.5.2.2. Think of A and B as observables and as a set of experiments 
performed on these two variables. For example, let's say T is the set of possible tem- 
peratures of a gas in a fixed container and let's say P is the set of possible pressures of 
the gas. We perform 1000 experiments in which we change and record the temperature 

and we simultaneously also record the pressure; this is a span T <^ E P. The results 
might look like this: 



Experiment 


ID 


Temperature 


Pressure 


1 


100 


72 


2 


100 


73 


3 


100 


72 


4 


200 


140 


5 


200 


138 


6 


200 


141 









00 



Definition 2.5.2.3. Let A, B, and C be sets, and lei A ^ R ^ B s^nd B R' ^ B 
be spans. Their composite span is given by the fiber product R xb R' diS in the diagram 
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below: 




R 



R' 




A 



B 



C 



Application 2.5.2.4. Let's look back at our lab's experiment from Application 2.5.2.2, 

which resulted in a span T <^ E ^ P. Suppose we notice that something looks a little 
wrong. The pressure should be linear in the temperature but it doesn't appear to be. 
We hypothesize that the volume of the container is increasing under pressure. We look 
up this container online and see that experiments have been done to measure the volume 
as the interior pressure changes. The data has generously been made available online, 

which gives us a span P E' V . 

The composite of our lab's span with the online data span yields a span T ^ E" ^ V, 
where E^' := E xp E' . What information does this span give us? In explaining it, one 
might say "whenever an experiment in our lab yielded the same pressure as one they 
recorded, let's call that a data point. Every data point has an associated temperature 
(from our lab) and an associated volume (from their experiment). This is the best we 
can do." 

The information we get this way might be seen by some as unscientific, but it certainly 
is the kind of information people use in business and in every day life calculation — we get 
our data from multiple sources and put it together. Moreover, it is scientific in the sense 
that it is reproducible. The way we obtained our T-V data is completely transparent. 



We can relate spans to matrices of natural numbers, and see a natural "categorifica- 
tion" of matrix addition and matrix multiplication. If our spans come from experiments 
as in Applications 2.5.2.2 and 2.5.2.4 the matrices involved will look like huge but sparse 
matrices. Let's go through that. 

Let A and B be sets and \ei A <^ R ^ B he span. By the universal property of 
products, we have a unique map R ^ A x B. 

We make a matrix of natural numbers out of this data as follows. The set of rows 
is A, the set of columns is B. For elements a e A and b e B, the (a, 6)-entry is the 
cardinality of its preimage, |p~^(a, 6)|, i.e. the number of elements in R that are sent by 
p to (a, b). 

Given two spans of the same dimensions, i.e. A ^ R ^ B and A ^ R' ^ B we can 
take the disjoint union Ria R' and by the universal property of coproducts we have a 
unique span A ^ Ria R' ^ B making the requisite diagram commute. The matrix 



00 



10 



R 




A 



RuR' 



B 




R' 
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corresponding to this new span will be the sum of the matrices corresponding to the two 
previous spans out of which it was made. 

Given a span A <^ R ^ B and a span B ^ S ^ C ^ the composite span can be formed 
as in Definition 2.5.2.3. It will correspond to the usual multiplication of matrices. 

Construction 2.5.2.5. Given a span A one can draw a bipartite graph with 

each element of A drawn as a dot on the left, each element of B drawn as a dot on the 
right, and each element r e R drawn as an arrow connecting vertex /(r) on the left to 
vertex g{r) on the right. 

Exercise 2.5.2.6. a.) Draw the bipartite graph (as in Construction 2.5.2.5) corresponding 
to the span T ^ E P in Application 2.5.2.2. 

b. ) Now make up your own span P ^ E' -^V and draw it. Finally, draw the composite 

span below. 

c. ) Can you say how the composite span graph relates to the graphs of its factors? 





2.5.3 Equalizers and terminal objects 
Definition 2.5.3.1. Suppose given two parallel arrows 

X^^Y. Eq{f,g)^^X^^Y (2.34) 

9 9 

The equalizer of f and g is the commutative diagram as to the right in (2.34), where we 
define 

Eqif,g):={xeX\f{x)=gix)} 
and where p is the canonical inclusion. 

Example 2.5.3.2. Suppose one has designed an experiment to test a theoretical prediction. 
The question becomes, "when does the theory match the experiment?" The answer is 
given by the equalizer of the following diagram: 

should, according to theory, yield 



an mput an output 

according to experiment yields 

The equalizer is the set of all inputs for which the theory and the experiment yield the 
same output. 

Exercise 2.5.3.3. Come up with an olog that uses equalizers in a reasonably interest- 
ing way. Alternatively, use an equalizer to specify those published authors who have 
published exactly one paper. Hint: find a function from authors to papers; then find 
another. 
Exercise 2.5.3.4. Find a universal property enjoyed by the equalizer of two arrows, and 
present it in the style of Lemmas 2.4.1.10, 2.4.2.7, and 2.5.1.14. 
Exercise 2.5.3.5. a.) A terminal set is a set S such that for every set X, there exists a 
unique function X ^ S. Find a terminal set. 

b.) Do you think that the notion terminal set belongs in this section (Section 2.5)? How 
so? If products, pullbacks, and equalizers are all limits, what do limits have in 
common? 
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2.6 Finite colimits in Set 

This section will parallel Section 2.5 — I will introduce several types of finite colimits and 
hope that this gives the reader some intuition about them, without formally defining 
them yet. Before doing so, I must define equivalence relations and quotients. 

2.6.1 Background: equivalence relations 

Definition 2.6.1.1 (Equivalence relations and equivalence classes). Let X be a set. An 
equivalence relation on X is a subset R ^ X x X satisfying the following properties for 
all x^y^ z E X: 

Reflexivity: (x, x) e R; 

Symmetry: (x, y) e R \i and only if x) e R; and 
Transitivity: if (x, y) e R and z) e R then (x, z) e R. 

If R is an equivalence relation, we often write x or simply x ~ to mean (x, y) e R. 

For convenience we may refer to the equivalence relation by the symbol ~, saying that 
~ is an equivalence relation on X. 

An equivalence class o/ ~ is a subset A <^ X such that 

• A is nonempty, A 0; 

• if X E A and x' e A, then x ~ x'; and 

• if X E A and x ^ y^ then y e A. 

Suppose that ~ is an equivalence relation on X. The quotient of X by denoted X/ ~ 
is the set of equivalence classes of ^. 

Example 2.6.1.2. Let Z denote the set of integers. Define a relation ^ Z x Z by 

R = {(x, ?/) I 3n G Z such that x -\- In = y). 

Then R is an equivalence relation because x + 7 * = x (refiexivity) ] x + 7 = y if and 
only if + 7 * (— n) = x (symmetry); and x -\- 7n = y and y + 7m = z together imply 
that X + 7(m -\- n) = z (transitivity). 

Exercise 2.6.1.3. Let X be the set of people on earth; define a binary relation R ^ X x X 
on X as follows. For a pair (x, y) of people, say (x^y) e Rifx spends a lot of time thinking 
about y. 

a. ) Is this relation refiexive? 

b. ) Is it symmetric? 

c. ) Is it transitive? 



Example 2.6.1.4. [Partitions] 

An equivalence relation on a set X can be thought of as a way of partitioning X. A 
partition of X consists of a set /, called the set of partitions^ and for every element i e I 
a subset Xi <^ X such that two properties hold: 
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• every element x G X is in some subset (i.e. for all x g X there exists i e I such 
that X e Xi); and 

• no element can be found in two different subsets (i.e. if x e Xi and x g Xj then 
i = j)- 

Given a partition of X, we define an equivalence relation on X by saying x ^ x' 
if X and x' are in the same partition (i.e. if there exists i e I such that x^x' g Xi). 
The partitions become the equivalence classes of this relation. Conversley, given an 
equivalence relation, one makes a partition on X by taking / to be the set of equivalence 
classes and for each i e I letting Xi be the elements in that equivalence class. 

Exercise 2.6.1.5. Let X and B be sets and let / : X ^ 5 be a function. Define a subset 
Q X X X by 

R = {{x,y)\f{x)=f{y)}. 

Is R an equivalence relation? 

a. ) Are all equivalence relations on X obtainable in this way (as the fibers of some 

function having domain X)? 

b. ) Does this viewpoint on equivalence classes relate to that of Example 2.6.1.4? 



Exercise 2.6.1.6. Take a set / of sets; i.e. suppose that for each element i e I you are 
given a set X^. For every two elements i^j^I say that i ^ j if Xi and Xj are isomorphic. 
Is this relation an equivalence relation on 17 

Definition 2.6.1.7 (Generating equivalence relations). Let X be a set and R X x X 
a subset. Define S to be the smallest subset of X x X such that S is an equivalence 
relation and R'^ S. We say that S is the equivalence relation generated by R. 

Exercise 2.6.1.8. Consider the set R of real numbers. Draw the coordinate plane R x R, 
give it coordinates x and y. A binary relation on R is a subset 6* ^ R x R, which can be 
drawn as a set of points in the plane. 

a. ) Draw the relation {{x^y) \ y = x'^}. 

b. ) Draw the relation {{x^y) \ y ^ x^}. 

c. ) Let So be the equivalence relation on R generated (in the sense of Definition 2.6.1.7) 

by the empty set. Draw 5* as a subset of the plane. 

d. ) Consider the equivalence relation Si generated by {(1, 2), (1, 3)}. Draw Si in the 

plane. Highlight the equivalence class containing (1,2). 

e. ) The refiexivity property and the symmetry property have pleasing visualizations in 

R X R; what are they? 

f. ) Is there a nice heuristic for visualizing the transitivity property? 



Exercise 2.6.1.9. Consider the binary relation R = {{n^n -\- 1) | n g Z} ^ Z x Z. 

a. ) What is the equivalence relation generated by R7 

b. ) How many equivalence classes are there? 
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Exercise 2.6.1.10. Suppose A/" is a network (or graph). Let X be the nodes of the 
network, and let R ^ X x X denote the relation such that {x,y) e R iS there exists an 
edge connecting x and y. 

a. ) What is the equivalence relation ~ generated by Rl 

b. ) What is the quotient X/ ^7 



2.6.2 Pushouts 

Definition 2.6.2.1 (Pushout). Suppose given the diagram of sets and functions below: 

W^^X (2.35) 

9 

Y 

Its fiber sum^ denoted Xu^^y, is defined as the quotient of Xu W^u Y" by the equivalence 
relation ~ generated hy w ^ f{w) and w ^ g{w) for all w e W . 

X LAW Y \= {X uW u Y)/ where Vk; e W, w ^ f{w) and w g{w). 

There are obvious inclusions ii: X ^ X and 12'. Y ^ X uw y • Note that if 

Z = X uw Y then the diagram 

W^-^Y (2.36) 

/ i2 

X — ^z 

commutes. Given the setup of Diagram 2.35 we define the pushout of X and Y over W 
to be any set Z for which we have an isomorphism Z ^ X • The corner symbol 

in Diagram 2.36 indicates that Z is the pushout. 

Example 2.6.2.2. Let X = {x g R | ^ x ^ 1} be the set of numbers between and 1, 
inclusive, let F = {?/ g R | 1 ^ ^ 2} by the set of numbers between 1 and 2, inclusive, 

and let = {1}. Then the pushout X ^Y, where / and g are the "obvious" 

functions (1 ^ 1) is X uw 1^ = g R | ^ z ^ 2}, as expected. When we eventually 
get to general colimits, one can check that the whole real line can be made by patching 
together intervals in this way. 

Example 2.6.2.3. In each example below, the diagram to the right is intended to be a 
pushout of the diagram to the left. The new object, is the union of B and C, but 
instances of A are equated to their B and C aspects. This will be discussed after the 
two diagrams. 

■'^■'^Note that our term inclusions is not too good, because it seems to suggest that ii and 12 are injective 
(see Definition 2.7.5.1) and this is not always the case. 
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a cell 


in the 


shoulder 


is 




1 


3 


a cell 


in the 


torso 





c 

a cell in 
the arm 



a cell 


in the 


shoulder 


is 








B 


a cell 


in the 


torso 





is 

^ 


a cell in 




the arm 



D = BljaC 



a cell in the 
torso or arm 



In the left-hand olog (2.37, the two arrows are inclusions: the author considers every cell 
in the shoulder to be both in the arm and in the torso. The pushout is then just the 
union, where cells in the shoulder are not double-counted. 



a college 
mathematics 



course 



yields 



C 

an utterance 
of the phrase 
"too hard" 



a college 
course 



a college 
mathematics 



course 



yields 



a college 
course 



an utterance 
of the phrase 
"too hard" 



a college course, 
where every 
mathematics 
course is 
replaced by an 
utterance of the 
phrase "too 
hard" 



(2.38) 



In Olog (2.37), the shoulder is seen as part of the arm and part of the torso. When 
taking the union of these two parts, we do not want to "double-count" the shoulder (as 
would be done in the coproduct B u see Example 2.4.2.12). Thus we create a new 
type A for cells in the shoulder, which are considered the same whether viewed as cells in 
the arm or cells in the torso. In general, if one wishes to take two things and glue them 
together, with A as the glue and with B and C as the two things to be glued, the union 
is the pushout B ua C. (A nice image of this can be seen in the setting of topological 
spaces, see Example 4.5.3.29.) 

In Olog (2.38), if every mathematics course is simply "too hard," then when reading 
off a list of courses, each math course will not be read aloud but simply read as "too 
hard." To form D we begin by taking the union of B and C, and then we consider 
everything in A to be the same whether one looks at it as a course or as the phrase "too 
hard." The math courses are all blurred together as one thing. Thus we see that the 
power to equate different things can be exercised with pushouts. 

Exercise 2.6.2 A. Let W, X, Y be as drawn and f:W^X and g-.W^Y the indicated 
functions. 
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X 



w 




The pushout of the diagram X 
of P = n as a natural number n e 



f 



W > F is a set P. Write down the cardinality 





Exercise 2.6.2.5. Suppose that W = 0] what can you say about X uw Z7 

Exercise 2.6.2.6. Let := M = {0, 1, 2, . . .} denote the set of natural numbers, let X = Z 
denote the set of integers, and let Y = {©} denote a one-element set. Define f : W ^ X 
by f{w) = —{w + 1), and define ^: ^ F to be the unique map. Describe the set 
XiAwY- 

Exercise 2.6.2.7. Let z: ^ X x X be an equivalence relation (see Example 2.1.2.3 
for notation). Composing with the projections 7ri,7r2: X x X ^ X, we have two maps 
7Ti o : R ^ X and 712 o i: R ^ X. 

a.) What is the pushout 



b.) If z : ^ X X X is not assumed to be an equivalence relation, we can still define the 
pushout above. Is there a relationship between the pushout X R X and 
the equivalence relation generated by ^ X x X? 



Lemma 2.6.2.8 (Universal property for pushout). Suppose given the diagram of sets 
and functions as below. 

W—^Y 



X 



For any set A and commutative solid arrow diagram as below ( i. e. functions f: X ^ A 
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XuwY 

there exists a unique arrow f uw 9' X uw Y ^ A making everything commute, 
f = {f^wg)oh and g = {fuwg)oi2. 

2.6.3 Other finite colimits 

Definition 2.6.3.1. [Coequalizer] 
Suppose given two parallel arrows 

X Y. X Y Coeqi f, g) (2.40) 

9 9 

The coequalizer of f and g is the commutative diagram as to the right in (2.40), where 
we define 

Coeq{f,g) := Y / f(x) ^ g(x) 

i.e. the coequalizer of / and g is the quotient of Y by the equivalence relation generated 
by {ifix),g{x))\xeX}<^Y xY 

Exercise 2.6.3.2. Let X = R be the set of real numbers. What is the coequalizer of the 
two maps X ^ X given hy x ^ x and x ^ (x + 1) respectively? 

Exercise 2.6.3.3. Find a universal property enjoyed by the coequalizer of two arrows. 

Exercise 2.6.3.4 (Initial object). An initial set is a set 5* such that for every set A, there 
exists a unique function S ^ A. 

a. ) Find an initial set. 

b. ) Do you think that the notion initial set belongs in this section (Section 2.6)? How 

so? If coproducts, pushouts, and coequalizers are all colimits, what do colimits have 
in common? 





2.7 Other notions in Set 

In this section we discuss some left-over notions in the category of Sets. 
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2.7.1 Retractions 

Definition 2.7.1.1. Suppose we have a function f: X ^ Y and a function g: Y ^ X 
such that g o f = idx- In this case we caU / a retract section and we caU g a retract 
projection. 

Exercise 2.7.1.2. Create an olog that includes sets X and F, and functions f : X ^ Y 
and g: Y ^ X such that go f = idx but such that f o g ^ idy ; that is, such that / is a 
retract section but not an isomorphism. 

2.7.2 Currying 

Currying is the idea that when a function takes many inputs, we can input them one at 
a time or all at once. For example, consider the function that takes a material M and 
an extension E and returns the force transmitted through the material when it is pulled 
to that extension. This is a function e: "^a material^ x W extension^ "^a force^. This 
function takes two inputs at once, but it is convenient to "curry" the second input. Recall 
that HomsetC^an extension^ "^a force^) is the set of theoretical force-extension curves. 
Currying transforms e into a function 

e' : "^a material^ HomsetC^an extension^ "^a force^). 

This is a more convenient way to package the same information. 

In fact, it may be convenient to repackage this information another way. For any 
extension, we may want the function that takes a material and returns how much force 
it can transmit at that extension. This is a function 

e'' : "^an extension^ Homset ( "^a material^ , "^a force^ ) . 

Notation 2.7.2.1. Let A and B be sets. We sometimes denote the set of functions from 
A to 5 by 

:=Romset{A,B). (2.41) 

Exercise 2.7.2.2. For a finite set A, let 1^41 g M denote the cardinality of (number of 
elements in) A. If A and B are both finite (including the possibility that one or both 
are empty), is it always true that l^^^l = l^l'"^!? 

Proposition 2.7.2.3 (Currying). Let A denote a set. For any sets X^Y there is a 
bijection 

(t>: Homset(^ x A,Y) ^ Homset(^, Y^). (2.42) 

Proof. Suppose given f: X x A ^ Y. Define (/>(/) : X Y^ as follows: for any x e X 
let (j){f){x) : A^Y he defined as follows: for any a e A, let (j){f){x){a) := /(x, a). 

We now construct the inverse, i/j: Homset (-^, ^^) IIomset(-^ x A^Y). Suppose 
given g: X ^ Y^. Define il^{g) : X x A ^ Y" as follows: for any pair (x, a) e X x A) let 
il;{g){x,a) := g{x){a). 

Then for any / G Homset(-^ x A^Y) we have ijj o (j)[f){x.,a) = (j){f){x){a) = f{x,a), 
and for any g e Homset(-^5 ^^) we have (f) o 2p{g){x){a) = 2p{g){x,a) = g{x){a), Thus we 
see that is an isomorphism as desired. 

□ 
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Exercise 2.7.2.4. Let X = {1,2}, A = {a, 6}, and Y = {x,y}. 

a. ) Write down three distinct elements of L := Homset(-^ x ^7^)- 

b. ) Write down all the elements of M := Homset(^,^)- 

c. ) For each element i e write down the corresponding function : X ^ M guar- 

anteed by Proposition 2.7.2.3. 



Exercise 2.7.2.5. Let A and B be sets. We know that Homset(^, ^) = so we 
have a function id^A : Homset(^, ^) B^. Look at Proposition 2.7.2.3, making the 
substitutions X = HomsetC^,^), Y = and A = A. Consider the function 

Homset(Homset(A,5),5^) ^ Homset(Homset(A, 5) x A, B) 

obtained as the inverse of (2.42). We have a canonical element id^A in the domain 
of We can apply the function and obtain an element ev = 0~^(id^A) g 

Homset(Homset(^? ^) x A, 5), which is itself a function, 

ev: Homset(^, ^) x A^ B. 

a. ) Describe the function ev in terms of how it operates on elements in its domain. 

b. ) Why might one be tempted to denote this function by ev7 



If n G M is a natural number, recall from (2.6) that there is a nice set n = {1,2,..., n}. 
If A is a set, we often make the abbreviation 

:= A^. (2.43) 

Exercise 2.7.2.6. In Example 2.4.1.7 we said that is an abbreviation for R x R, but 
in (2.43) we say that R^ is an abbreviation for R-. Use Exercise 2.1.2.13, Proposition 
2.7.2.3, Exercise 2.4.2.10, and the fact that 1+1=2, to prove that these are isomorphic, 
R2 ^ (R X R). 

(The answer to Exercise 2.1.2.13 was A = {©}: i.e. Homset({©}, ^) = X for ah X.) 



2.7.3 Arithmetic of sets 

Proposition 2.7.3.1 summarizes the properties of products, coproducts, and exponentials, 
and shows them all in a familiar light, namely that of arithmetic. In fact, one can think 
of the natural numbers as literally being the isomorphism classes of finite sets — that's 
what they are used for in counting. Consider the standard procedure for counting the 
elements of a set say cows in a field: one points to an element in S and simultaneously 
says "1", points to another element in 5* and simultaneously says "2", and so on until 
finished. This procedure amounts to nothing more than creating an isomorphism (one- 
to-one mapping) between 5* and some set n. 

Again, the natural numbers are the isomorphism classes of finite sets. Their behavior, 
i.e. the arithmetic of natural numbers, refiects the behavior of sets. For example the fact 
that multiplication distributes over addition is a fact about grids of dots as in Example 
2.4.1.2. The following proposition lays out such arithmetic properties of sets. 
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In this proposition, we denote the coproduct of two sets A and B by the notation 
A -\- B rather than A u B. It is a reasonable notation in general, and one that is often 
used. 

Proposition 2.7.3.1. The following isomorphisms exist for any sets A, 5, and C (except 
for one caveat, see Exercise 2.7.3.2). 

A + ^^A 

A+B^B+A 

{A + B) + C ^A + {B + C) 

A X 0^0 

Axl^A 

Ax B ^ B X A 

{Ax B) X C ^ Ax {B X C) 

Ax{B + C)^{AxB) + {AxC) 

A^^l 

A^^A 

1^ = 1 

{A^f ^ A^x^ 

Exercise 2.7.3.2. Everything in Proposition 2.7.3.1 is true except in one case, namely 
that of 

In this case, we get conflicting answers, because for any set A, including A = = 0, we 
have claimed both that A- = 1 and that 0^ = 0. 

What is the correct answer for 0-, based on the definitions of and 1, given in (2.6), 
and of A^, given in (2.41)? 

Exercise 2.7.3.3. It is also true of natural numbers that if a, 6 g N and ab = then either 
a = or 6 = 0. Is the analogous statement true of all sets? 

2.7.4 Subobjects and characteristic functions 

Definition 2.7.4.1. For any set 5, define the power set of B, denoted P(5), to be the 
set of subsets of B. 

Exercise 2.7.4.2. a.) How many elements does P(0) have? 

b. ) How many elements does P({©}) have? 

c. ) How many elements does P({1, 2, 3, 4, 5, 6} have? 

d. ) Any idea why they may have named it "power set"? 
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2.7.4.3 Simplicial complexes 

Definition 2.7.4.4. Let ^ be a set and let P(V) be its powerset. A subset X ^ P(V) 
is called downward- dosed if, for every u e X and every u' ^ we have u' e X. We say 
that X contains all atoms if for every v eV the singleton set {v} is an element of X. 

A simplicial complex is a pair (V, X) where V is a set and X ^ P(V) is a downward- 
closed subset that contains all atoms. The elements of X are called simplices (singular: 
simplex). Any subset u<^V has a cardinality so we have a function X ^ M sending 
each simplex to its cardinality. The set of simplices with cardinality n + 1 is denoted X^ 
and each element x e Xn is called an n-simplex. Since X contains all atoms (subsets 
of cardinality 1), we have Xq = V ^ and we may also call the 0-simplices vertices. We 
sometimes call the 1-simplices edges. ^'^ 

Since Xq = V ^ we may denote a simplicial complex (V, X) simply by X. 

Example 2.7.4.5. Let n g M be a natural number and let V = n + 1 . Define the n-simplex, 
denoted A"^, to be the simplicial complex ¥{V) ^ P(^), i.e. the whole power set, which 
indeed is downward-closed and contains all atoms. 

We can draw a simplicial complex X by first putting all the vertices on the page as 
dots. Then for every x G Xi, we see that x = {v,v'} consists of 2 vertices, so we draw 
an edge connecting v and v' . For every y g X2 we see that y = {w,w\w''} consists of 3 
vertices, so we draw a (filled-in) triangle connecting them. All three edges will be drawn 
too because X is assumed to be downward closed. 

Thus, the 0-simplex A^, the 1-simplex A^, the 2-simplex A^, and the 3-simplex A^ 
are drawn here: 



2 2 




The n-simplices for various n's are in no way all of the simplicial complexes. In 
general a simplicial complex is a union or "gluing together" of simplices in a prescribed 
manner. For example, consider the simplicial complex X with vertices Xq = {1,2,3,4}, 
edges Xi = {{1, 2}, {2, 3}, {2, 4}}, and no higher simplices X2 = X3 = • • • = 0. We 
might draw X as follows: 

12 3 
• • • 



4 

Exercise 2.7.4.6. Let X be the following simplicial complex, so that Xq = {A, 5, . . . , M}. 



^"^It is annoying at first that the set of subsets with cardinahty 1 is denoted Xq, etc. But this is 
standard convention because as we will see, Xn will be n-dimensional. 

-•^^The reason we wrote Xq = V rather than Xq ^ V is that Xq is the set of 1-element subsets of V. 
So if y = {a, b, c} then Xq = {{a}, {b}, {c}}. This is really just pedantry. 
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E 



B 



L 



M 



In this case Xi consists of elements like {A, B} and {D^ K} but not {D^ J}. 

Write out X2 and X3 (hint: the drawing of X indicates that X3 should have one 



Exercise 2.7.4.7. The 2-simplex is drawn as a filled-in triangle with vertices V = 
{1, 2, 3}. There is a simplicial complex X = d/S?' that would be drawn as an empty trian- 
gle with the same set of vertices. What is XI In other words what are Xq, Xi , X2, X3, . . .? 



2.7.4.8 Subobject classifier 

Definition 2.7.4.9. Define the subobject classifier for Set, denoted Q to be the set 

Q := {True^False}^ together with the function {©} Vt sending the unique element to 
True. 

Proposition 2.7.4.10. Let B be a set. There is an isomorphism 



Proof. Given a function f : B ^ Vt, \ei (/){/) = {b e B \ f{b) = True} ^ B. We now 
construct a function tjj: ¥{B) Homset(^,^) to serve as the inverse of (/). Given a 
subset i: B' <^ B^ define : 5 ^ as follows: 



element. 











(/>: Homset(5,J^) ^P(5). 




One checks easily that (j) and ip are mutually inverse. 



□ 



Definition 2.7.4.11 (Characteristic function). Given a subset B' ^ 5, we call the 
corresponding function B ^ Vt the characteristic function of B' in B. 
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Let B be any set and let ¥{B) be its power set. By Proposition 2.7.4.10 there is a 
bijection between ¥{B) and . Since Q has cardinahty 2, the cardinahty of F{B) is 
2l^l, which explains the correct answer to Exercise 2.7.4.2. 

Exercise 2.7.4.12. Let f : A ^ Q denote the characteristic function of some A' ^ A, and 
define A^' Q A to be its complement, A^' := A — A^ (i.e. a e A!' if and only if a ^ A!). 

a. ) What is the characteristic function of A!' ^ Al 

b. ) Can you phrase it in terms of some function ^ ^ W. 



2.7.5 Surjections, injections 

The classical definition of injections and surjections involves elements, which we give now. 
But a more robust notion involves all maps and will be given in Proposition 2.7.5.4. 

Definition 2.7.5.1. Let / : X ^ Y" be a function. We say that / is surjective if, for all 
y e Y there exists some x e X such that f{x) = y. We say that / is injective if, for all 
X e X and all x^ e X with f{x) = f{x') we have x = x' . 

A function that is both injective and surjective is called bijective. 

Remark 2.7.5.2. It turns out that a function that is bijective is always an isomorphism 
and that all isomorphisms are bijective. We will not show that here, but it is not too 
hard; see for example [Big, Theorem 5.4]. 

Definition 2.7.5.3 (Monomorphisms, epimorphisms). Let / : X ^ F be a function. 
We say that / is a monomorphism if for all sets A and pairs of functions h^h^ : A ^ X, 



h 




h' 



\i f oh = f oh' then h = h' . 

We say that / is an epimorphism if for all sets B and pairs of functions g^g' \Y ^ B^ 



9 




9' 



if ^ o / = o / then g = g' . 

Proposition 2.7.5.4. Let f : X ^ Y be a function. Then f is injective if and only if 
it is a monomorphism; f is surjective if and only if it is an epimorphism. 

Proof. If / is a monomorphism it is clearly injective by putting A = {©}. Suppose that 
/ is injective and let h^h' : A ^ X he functions such that f o h = f o h\ but suppose 
for contradiction that h 7^ h' . Then there is some element a e A such h{a) ^ h{a') e X. 
But by injectivity f{h{a)) 7^ f{h{a'))^ contradicting f oh = f oh' . 

Suppose that / : X ^ F is an epimorphism and choose some y^ ^Y (noting that if 
Y is empty then the claim is vacuously true). Let g\Y^^ denote the characteristic 
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function of the subset {yo} ^ Y and let g' : Y ^ Q denote the characteristic function of 
Y ^Y; note that g{y) = g'{y) for ah y 7^ yo. Then since g ^ g' we have go f ^ g' o f^so 
there exists x e X with g{f{x)) ^ g'{f{x))^ which imphes that f{x) = yo- This proves 
that / is surjective. 

Finahy, suppose that / is surjective, and let g^g' : Y ^ B he functions with g o f = 
g' o /. For any y e Y ^ there exists some x e X with f{x) = y^ so g{y) = g{f{x)) = 
g'{f{x)) = g\y). This proves that / is an epimorphism. 



Proposition 2.7.5.5. Let f : X ^ Y be a monomorphism. Then for any function 
g: A ^Y , the top map f: X xy A ^ A in the diagram 



X Xy A- 
9' 

X 



Y 



is a monomorphism. 

Proof. To show that is a monomorphism, we take an arbitrary set B and two maps 
m, n: B ^ X xy A such that f^om = /'on, denote that function by p := f'om\ B ^ A. 
Now let = o m and r = g' o n. The diagram looks like this: 




We have that 

f^q = f^g'^ra = gof'om = gof'on = fog'on = for 

But we assumed that / is a monomorphism so this implies that q = r. By the universal 
property of pullbacks. Lemma 2.5.1.14, we have m = n. 

□ 

Exercise 2.7.5.6. Show, in analogy to Proposition 2.7.5.5, that pushouts preserve epi- 
morphisms. 

Example 2.7.5.7. Suppose an olog has a fiber product square 



X xzY- 
f 

^ 

X 



Y 



such that / is intended to be an injection and g is any map. In this case, there are 
nice labeling systems for /^^', and X xzY. Namely: 



^Of course, this diagram is symmetrical, so the same ideas hold if g is an injection and / is any map. 
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• "is" is an appropriate label for 

• the label for g is an appropriate label for g'^ 

• (the label for X, then "which", then the label for g, then the label for Y) is an 
appropriate label for X Y. 

To give an explicit example, 



XxzY 



a rib which is 
made by a cow 



is made by 



a cow 



Y 
X 

a rib 



is made by 



an animal 



Corollary 2.7.5.8. Let i: A ^ X be a monomorphism. Then there is a fiber product 
square of the form 




(2.44) 



Proof. Let X' X denote the image of i and let f : X ^ Q denote the characteristic 
function of X' ^ X . Then it is easy to check that Diagram 2.44 is a pullback. 

□ 

Exercise 2.7.5.9. Consider the subobject classifier 1], the singleton {©} and the map 
{©} "^^^^> Q from Definition 2.7.4.9. Look at diagram 2.44 and in the spirit of Exercise 
2.7.5.7, come up with a label for a label for {©}, and a label for True. Given a label 
for X and a label for /, come up with a label for A, a label for i and a label for such 
that the English smoothly fits the mathematics. 



2.7.6 Multisets, relative sets, and set-indexed sets 

In this section we prepare ourselves for considering categories other than Set, by looking 
at some categories related to Set. 



2.7.6.1 Multisets 

Consider the set X of words in a given document. If WC{X) is the wordcount of the 
document, we will not generally have WC{X) = \X\. The reason is that a set cannot 
contain the same element more than once, so words like "the" might be undercounted in 
\X\. A multiset is a set in which elements can be assigned a multiplicity, i.e. a number 
of times they are to be counted. 

But if X and Y are multisets, what is the appropriate type of mapping from X 
to Y7 Since every set is a multiset (in which each element has multiplicity 1), let's 
restrict ourselves to notions of mapping that agree with the usual one on sets. That 
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is, if multisets X and Y happen to be sets then our mappings X ^ Y should just be 
functions. 

Exercise 2.7.6.2. a.) Come up with some notion of mapping for multisets that generalizes 
functions when the notion is restricted to sets. Suppose that X = (1,1,2,3) and 
Y = (a, 6, 6, 6), i.e. X = {1, 2, 3} with 1 having multiplicity 2, and Y = {a, b} with b 
having multiplicity 3. 

b.) What are all the maps X ^ Y in your notion? 



In Chapter 4 we will be getting to the definition of category, and you can test whether 
your notion of mapping in fact defines a category. Here is my definition of mapping for 
multisets. 

Definition 2.7.6.3. A multiset is a sequence X := (E^B^ir) where E and B are sets 
and 7t: E ^ B is Si surjective function. We refer to E as the set of element instances 
of X^ we refer to B as the set of element names of X^ and we refer to tt as the naming 
function for X. Given an element name x e B^ let 7r~^{x) E he the preimage; the 
number of elements in 7r~^{x) is called the multiplicity of x. 

Suppose that X = (E^B^n) and X' = {E'^B'^n') are multisets. A mapping from 
X to Y ^ denoted f:X F, consists of a pair (/i,/o) such that fi: E E' and 
fo'. B ^ B' are functions and such that the following diagram commutes: 

E — ^ E' (2.45) 

Y i 
B ^B\ 

fo 

Exercise 2.7.6A. Suppose that a pseudo-multiset is defined to be almost the same as a 
multiset, except that tt is not required to be surjective. Write down a pseudo- multiset 
that is not a multi-set. 

a. ) Describe the difference between the two notions in terms of multiplicities. 

b. ) Complexity of names aside, which do you think is a more useful notion: multiset or 

pseudo- multisets? 



Exercise 2.7.6.5. Consider the multisets described in Exercise 2.7.6.2. 

a. ) Write each of them in the form (£^,5,7r), as in Definition 2.7.6.3. 

b. ) In terms of the same definition, what are the mappings X Y7 

c. ) If we remove the restriction that diagram 2.45 must commute, how many mappings 

X ^ Y are there? 



2.7.6.6 Relative sets 

Let's continue with our ideas from multisets, but now suppose that we have a fixed set B 
of names that we want to keep once and for all. Whenever someone discusses a set, each 
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element must have a name in B. And whenever someone discusses a mapping, it must 
preserve the names. For example, if B is the set of English words, then every document 
consists of an ordered set mapping to B (e.g. 1 ^ Suppose, 2 ^ that, 3 ^ we, etc.) A 
mapping from document A to document B would send each word found somewhere in 
A to the same word found somewhere in B. This notion is defined carefully below. 

Definition 2.7.6.7 (Relative set). Let 5 be a set. A relative set over B^ or simply a 
set over B, is a pair {E^ tt) such that £^ is a set and n: E ^ B is di function. A mapping 
of relative sets over B^ denoted /: (E^tt) {E'^tt')^ is a function /: E ^ E' such that 
the triangle below commutes, i.e. tt = tt' o /, 

E ^-^E' 



B 

Exercise 2.7.6.8. Given sets X, F, Z and functions f : X ^ Y and ^: F ^ Z, we can 
compose them to get a function X ^ Z. If 5 is a set, if (X,p), (Y^q)^ and (Z, r) are 
relative sets over 5, and if /: (X,p) iX^q) and g: (Y^q) (Z, r) are mappings, is 
there a reasonable notion of composition such that we get a mapping of relative sets 
(X,p) {Z^r)l Hint: draw diagrams. 
Exercise 2.7.6.9. a.) Let {©} denote a set with one element. What is the difference 
between sets over {©} and simply sets? 

b.) Describe the sets relative to 0. How many are there? 





2.7.6.10 Indexed sets 

Let ^4 be a set. Suppose we want to assign to each element a g A a set This is called 
an A- indexed set. In category theory we are always interested in the legal mappings 
between two different structures of the same sort, so we need a notion of A-indexed 
mappings; we do the "obvious thing". 

Example 2.7.6.11. Let C be a set of classrooms. For each c g C let Pc denote the set of 
people in classroom c, and let Sc denote the set of seats (chairs) in classroom c. Then P 
and 5* are C-indexed sets. The appropriate kind of mapping between them respects the 
indexes. That is, a mapping of multi-sets P ^ S should, for each classroom c g C, be a 
function Pc^ Sc-^^ 

Definition 2.7.6.12. Let A be a set. An A-indexed set is a collection of sets Sa-, one for 
each element a g A; for now we denote this by {Sa)aeA' If {S'^)aeA is another A-indexed 
set, a mapping of A-indexed sets from {Sa)aeA to {S'^)aEA^ denoted 

{fa)aEA'' {Sa)aEA ~^ {S^)aEA 

is a collection of functions fa'. Sa ^ S^^, one for each element a e A. 

Exercise 2.7.6.13. Let {©} denote a one element set. What are {©}-indexed sets and 
mappings between them? 
Exercise 2.7.6.14. There is a strong relationship between A-indexed sets and relative sets 
over A. What is it? 



■"^^If we wanted to allow people from any classroom to choose a chair from just any classroom, category 
theory would tell us to reconsider P and S as sets, forgetting their indices. See Section 5.1.4.6. 



Chapter 3 



Categories and functors, 
without admitting it 

In this chapter we begin to use our understanding of sets to build more interesting 
mathematical devices, each of which organizes our understanding of a certain kind of 
domain. For example, monoids organize our thoughts about agents acting on objects; 
groups are monoids except restricted to only allow agents to act reversibly. We will 
then study graphs, which are systems of nodes and arrows that can capture ideas like 
information flow through a network or model connections between building blocks in a 
material. We will discuss orders, which can be used to study taxonomies or hierarchies. 
Finally we take a mathematical look at databases, which actually subsume everything 
else in the chapter. Databases are connection patterns for structuring information. 

We will see in Chapter 4 that everything we study in the present chapter is an example 
of a category. So is Set, the category of sets studied in Chapter 2. One way to think of 
a category is as a set of objects and a connection pattern between them; sets are objects 
(ovals full of dots if you wish) connected by functions. But each set is itself a category: 
the objects inside it are just disconnected! Just like a set has an interior view and an 
exterior view, so will all the categories in this chapter. Each monoid is a category, but 
there is also a category o/ monoids. 

However, we will not really say the word "category" much if at all in this chapter. 
It seems preferable to let the ideas rise on their own accord as interesting structures in 
their own right before explaining that everything in site fits into a single framework. 
That will be the pleasant reward to come in Chapter 4. 

3.1 Monoids 

A common way to interpret phenomena we see around us is to say that agents are acting 
on objects. For example, in a computer drawing program, the user acts on the canvas 
in certain prescribed ways. Choices of actions from an available list can be performed in 
sequence to transform one image into another. As another example, one might investigate 
the notion that time acts on the position of hands on a clock in a prescribed way. A 
first rule for actions is this: the performance of a sequence of several actions is itself the 
performance of an action — a more complex action, but an action nonetheless. 

Mathematical objects called monoids and groups are tasked with encoding the agent's 
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perspective in all this, i.e. what the agent can do, and what happens when different 
actions are done in succession. A monoid can be construed as a set of actions, together 
with a formula that encodes how a sequence of actions is itself considered an action. A 
group is the same as a monoid, except that every action is required to be reversible. In 
this section we concentrate on monoids; we will get to groups in Section 3.2. 

3.1.1 Definition and examples 

Definition 3.1.1.1 (Monoid). A monoid is a sequence (M, id, ★), where M is a set, 
id G M is an element, and ★iMxM Misa function, such that the following 
conditions hold for all m^n^p e M: 

• m ★ id = m, 

• id ★ m = m, and 

• {m n) p = m {n p) . 

We refer to id as the identity element and to ★ as the multiplication formula for the 
monoid. ^ We call the first two rules identity laws and the third rule the associativity 
law for monoids. 

Remark 3.1.1.2. To be pedantic, the conditions from Definition 3.1.1.1 should be stated 

• ★(m, id) = m, 

• ★(id, m) = m, and 

• ★(★(m, n),p) = ★(m, (★(?!, p)). 

The way they are written in Definition 3.1.1.1 is called infix notation^ and we often use 
infix notation without mentioning it. That is, given a function • : A x 5 ^ C, we may 
write a • b rather than -(a, b). 

Example 3.1.1.3 (Additive monoid of natural numbers). Let M = M be the set of natural 
numbers. Let id = and let M x M ^ M denote addition, so that ^(4, 18) = 22. 
Then the equations m ★ = m and ★ m = m hold, and (m ★ n) ★ p = m ★ (n ★ p). By 
assigning id and ★ in this way, we have "given M the structure of a monoid". 

Remark 3.1.1.4. Sometimes we are working with a monoid (M, id, ★), and the identity 
id and multiplication ★ are somehow clear from context. In this case we might refer to 
the set M as though it were the whole monoid. For example, if we were discussing the 
monoid from Example 3.1.1.3, we might refer to it as M. The danger comes because sets 
may have multiple monoid structures, as we see below in Exercise 3.1.1.5. 

Exercise 3.1.1.5. Let M = M be the set of natural numbers. Taking id = 1, come up 
with a formula for * that gives N the structure of a monoid. 

Exercise 3.1.1.6. In both Example 3.1.1.3 and Exercise 3.1.1.5, the monoids (M, id, ★) 
satisfied an additional rule called commutativity^ namely m*n = n*m for every m^ne M. 
There is a monoid (M, id, ★) lurking in linear algebra textbooks that is not commutative; 
if you have background in linear algebra try to answer this: what M, id, and ★ might I 
be referring to? 

■"^ Although the function M x M ^ M is called the multiplication formula, it may have nothing 
to do with multiplication. It is nothing more than a formula for taking two inputs and returning an 
output; calling it "multiplication" is suggestive of its origins, rather than prescriptive of its behavior. 
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Exercise 3.1.1.7. Recall the notion of commutativity for monoids from Exercise 3.1.1.6. 

a. ) What is the smallest set M that you can give the structure of a non-commutative 

monoid? 

b. ) What is the smallest set M that you can give the structure of a monoid? 



Example 3.1.1.8 (Trivial monoid). There is a monoid with only one element, M = 
({id},id, ★) where {id} x {id} {id} is the unique function. We call this monoid 
the trivial monoid^ and sometimes denote it 1. 

3.1.1.9 Free monoids and finitely presented monoids 

Definition 3.1.1.10. Let X be a set. A list in X is a pair (n, /) where n g M is a natural 
number (called the length of the list) and / : n ^ X is a function, where n = {1, 2, . . . , n}. 
We may denote such a list by 

(n,/) = [/(l),/(2),...,/(n)]. 

The empty list is the unique list in which n = 0; we may denote it by [ ]. Given an 
element x e X the singleton list on x is the list [x]. Given a list L = (n, /) and a number 
i eN with i ^ n, the ith entry of L is the element f{i) e X. 

Given two lists L = {n^ f) and L' = {n\f')^ define the concatenation of L and L'^ 
denoted L to be the list {n -\- n' ^ f f')^ where / ++ f : n -\- n' ^ X is given on 

i + n' hy 

f{i) if i ^ n 

f\i -n) if z ^ n + 1 



Example 3.1.1.11. Let X = {a, 6, c, . . . , z). The following are elements of List(X): 

[a,6,c], [p], [p,a,a,a,p], [], ... 

The concatenation of [a, 6, c] and [p, a, a, a,p] is [a, 6, c,p, a, a, a, a,p]. The concatenation 
of any list A with [ ] is just A. 

Definition 3.1.1.12. Let X be a set. The free monoid generated by X is the sequence 
M := (List(X), [ ], where List(X) is the set of ordered lists of elements in X, where 
[ ] G List(X) is the empty list, and where -H- is the operation of list concatenation. We 
refer to X as the set of generators for the monoid M. 

Exercise 3.1.1.13. Let {©} denote a one-element set. 

a. ) What is the free monoid generated by {©}? 

b. ) What is the free monoid generated by 0? 



In the definition below, we will define a monoid M by specifying some generators 
and some relations. Lists of generators provide us all the possible ways to write elements 
of M. The relations allow us to have two such ways of writing the same element. The 
following definition is a bit dense, so see Example 3.1.1.16 for a concrete example. 
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Definition 3.1.1.14 (Presented monoid). Let G be a finite set, let n g M be a natural 
number, ^ and for each 1 ^ z ^ n, let and m[ be elements of List(G). ^ The 
monoid presented by generators G and relations {(mi,m^) | 1 ^ i ^ n} is the monoid 
(M, id, ★) defined as follows. Let ~ denote the equivalence relation on List(G) generated 
by {{xniiy ~ xm[y) \ x^y e List(G), 1 ^ i ^ n}, and define M = List(G)/ ~. Let id = [ ] 
and let a * 6 be obtained by concatenating representing lists. 

Remark 3.1.1.15. Every free monoid is a presented monoid, because we can just take the 
set of relations to be empty. 

Example 3.1.1.16. Let G = {a,6, c, d}. Think of these as buttons that can be pressed. 
The free monoid List(G) is the set of all ways of pressing buttons, e.g. pressing a then a 
then c then c then d corresponds to the list [a, a, c, c, d]. The idea of presented monoids 
is that you notice that pressing [a,a,c] always gives the same result as pressing [d^d]. 
You also notice that pressing [c, a, c, a] is the same thing as doing nothing. 

In this case, we would have mi = [a, a, c], m[ = [d^ d]^ and m2 = [c, a, c, a], m2 = [ ] 
and relations {(mi,m'^), (m2,m2)}. Really this means that we're equating mi with m[ 
and m2 with m2, which for convenience we'll write out: 

[a, a, c] = [d, d] and [a, c, a, c] = [ ] 

To see how this plays out, we give an example of a calculation in M = List(G)/ ~. 
Namely, 

[6, c, 6, a, c, a, a, c, (i] = [6, c, 6, a, a, c, a, c, a, a, c, (i] = [6, c, 6, a, a, a, c, (i] 

= [6, c, 6, a, d]. 

Application 3.1.1.17 (Buffer). Let G = {a, 6, c, . . . z}. Suppose we have a buffer of 32 
characters and we want to consider the set of lists of length at most 32 to be a monoid. 
We simply have to decide what happens when someone types a list of length more than 
32. 

One option is to say that the last character typed overwrites the 32nd entry, 

[ai, a2, . . . , asi, a32, 6] ~i [ai, a2, . . . , asi, 6]. 

Another option is to say that any character typed after 32 entries is discarded, 

[ai,a2, . . . ,a3i,a32,6] -2 [ai, a2, . . . , aai, a32]. 

Both of these yield finitely presented monoids, generated by G. (In case it's useful, the 
number of necessary relations in both cases is 26^^.) 

00 

Exercise 3.1.1.18. Let's consider the buffer concept again (see Application 3.1.1.17), but 
this time only having size 3 rather than size 32. Show using Definition 3.1.1.14 that with 
relations given by ~i we indeed have [a, 6, c, e, /] = [a, 6, /] and that with relations 
given by ~2 we indeed have [a, 6, c, e, /] = [a, 6, c]. 

Exercise 3.1.1.19. Let K := {BS^ a^b^c^ . . . ^ z}, a set having 27 elements. Suppose you 
want to think of BS g as the "backspace key" and the elements a, 6, . . . z g as the 
letter keys on a keyboard. Then the free monoid List(i^) is not quite appropriate as a 
model because we want [a, 6, BS] = [a, b]. 

^The number n G N is going to stand for the number of relations we declare. 
^Each rrii and are going to be made equal in A4. 
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a. ) Choose a set of relations for which the monoid presented by generators K and the 

chosen relations is appropriate to this application. 

b. ) Under your relations, how does [BS] compare with []? Is that suitable? 



3.1.1.20 Cyclic monoids 

Definition 3.1.1.21. A monoid is called cyclic if it has a presentation involving only 
one generator. 

Example 3.1.1.22. Let Q be a symbol; we look at some cyclic monoids generated by 
{Q}. With no relations the monoid would be the free monoid on one generator, and 
would have underlying set {[ ], [Q], [Q, Q], [Q, Q, Q], • • •}, with identity element [ ] and 
multiplication given by concatenation (e.g. [Q, Q, Q] ++ [Q, Q] = [Q, Q, Q, Q, Q])- This 
is just M, the additive monoid of natural numbers. 

With the really strong relation [Q] ~ [ ] we would get the trivial monoid, a monoid 
having only one element (see Example 3.1.1.8). 

Another possibility is given in the first part of Example 3.1.2.3, where the relation 
[ ] is used, where is shorthand for [Q, Q, Q, Q, Q, Q, Q, Q, Q, Q, Q]. 

Exercise 3.1.1.23 (Classify the cyclic monoids). Classify all the cyclic monoids up to 
isomorphism. That is, come up with a naming system such that every cyclic monoid can 
be given a name in your system, such that no two non-isomorphic cyclic monoids have 
the same name, and such that no name exists in the system unless it refers to a cyclic 
monoid. 

Hint: one might see a pattern in which the three monoids in Example 3.1.1.22 corre- 
spond respectively to oo, 1, and 12, and then think "Cyclic monoids can be classified by 
(i.e. systematically named by elements of) the set N u {oo}." That idea is on the right 
track, but is not correct. 



3.1.2 Monoid actions 

Definition 3.1.2.1 (Monoid action). Let (M, id, ★) be a monoid and let 6* be a set. An 
action of (M, id, ★) on or simply an action of M on S or an M -action on is a 
function 

a iMxS^s 

such that the following conditions hold for all m, n g M and all 5 g S*: 

• id Q s = 5 

• m Q (n Q 5) = (m ★ n) Q 5. 

Remark 3.1.2.2. To be pedantic (and because it's sometimes useful), we may rewrite Q 
SiS a: M X S ^ S and restate the conditions from Definition 3.1.2.1 as 

^ Definition 3.1.2.1 actually defines a left action of (M, id, *) on S. A right action is like a left action 
except the order of operations is somehow reversed. We will not really use right-actions in this text, but 
we briefly define it here for completeness. With notation as above, the only difference is in the second 
condition. We replace it by the condition that for all m,n e M and all s G we have 



m O (n O s) = (n ★ m) O s 
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• a(id, s) = 5, and 

• a(m, a(n, 5)) = a(m ★ n, 5). 

Example 3.1.2.3. Let 5 = {0, 1, 2, ... , 11} and let N = (M, 0, +) be the additive monoid 
of natural numbers (see Example 3.1.1.3). We define a function Q: M x ^ 6* by taking 
a pair (n, s) to the remainder that appears when n + s is divided by 12. For example 
4 O 2 = 6 and 8 Q 9 = 5. This function has the structure of a monoid action because 
the two rules from Definition 3.1.2.1 hold. 

Similarly, let T denote the set of points on a circle, elements of which are denoted by 
a real number in the interval [0, 12), i.e. 

T = {xeR\0 <12} 

and let R = (R, 0, +) denote the additive monoid of real numbers. Then there is an 
action R x T ^ T^ similar to the one above (see Exercise 3.1.2.4). 

One can think of this as an action of the monoid of time on the clock. 

Exercise 3.1.2.4. a.) Realize the set T := [0,12) ^ R as the coequalizer of a pair of 
arrows R ^ R. 

b. ) For any x G R, realize the mapping x • — : T ^ T, implied by Example 3.1.2.3, using 

the universal property of coequalizers. 

c. ) Prove that it is an action. 



Exercise 3.1.2.5. Let B denote the set of buttons (or positions) of a video game controller 
(other than, say 'start' and 'select'), and consider the free monoid List (5) on B. 

a. ) What would it mean for List (5) to act on the set of states of some game? Imagine 

a video game that uses the controller, but for which List (5) would not be said to 
act on the states of G\ Now imagine a simple game G for which List (5) would be 
said to act. 

b. ) Can you think of a state s of G, and two distinct elements i^i' e List(5) such that 

£ G 8 = f G s? In video game parlance, what would you call an element b e B such 
that, for every state s E G, one has b G s = s? 

c. ) In video game parlance, what would you call a state s e S such that, for every 

sequence of buttons £ e List (5), one has £ G s = s7 



Application 3.1.2.6. Let / : R ^ R be a differentiable function of which we want to find 
roots (points x G R such that f{x) = 0. Let e R be a starting point. For any n e N 
we can apply Newton's method to Xn to get 

fjXn) 

This is a monoid (namely M, the free monoid on one generator) acting on a set (namely 
R). 

However, Newton's method can get into trouble. For example at a critical point it 
causes division by 0, and sometimes it can oscillate or overshoot. In these cases we want 
to perturb a bit to the left or right. To have these actions available to us, we would add 
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"perturb" elements to our monoid. Now we have more available actions at any point, 
but the system is less deterministic. 

When publishing an experimental finding, there may be some deep methodological 
questions that are not considered suitably important to mention. For example, one may 
not publish the kind solution finding method (e.g. Newton's method or Runga-Kutta) 
that was used, nor the set of available actions in terms of perturbation. However, these 
may actually infiuence the reproducibility of results. By using the language of monoid 
actions, we can align our data model with our unspoken assumptions about how functions 
are analyzed. 

00 

Remark 3.1.2.7. A monoid is useful for understanding how an agent acts on the set of 
states of an object, but there is only one kind of action. At any point, all actions are 
available. In reality it is often the case that contexts can change and different actions 
are available at different times. For example on a computer, the commands available in 
one application have no meaning in another. This will get us to categories in the next 
chapter. 

3.1.2.8 Monoids actions as ologs 

If monoids are understood in terms of how they act on sets, then it is reasonable to think 
of them in terms of ologs. In fact, the ologs associated to monoids are precisely those 
ologs that have exactly one type (and possibly many arrows and commutative diagrams). 

Example 3.1.2.9. In this example we show how to associate an olog to a monoid action. 
Consider the monoid M generated by the set {u^d^r}, standing for "up, down, right", 
and subject to the relations 

[u, d]^[], [d, ii] - [ ], [u, r] = [r, u], and [d, r] = [r, d]. 

We might imagine that M acts on the set of positions for a character in an old video 
game. In that case the olog corresponding to this action should look something like the 
following: 

Given x, a character position, consider tine following. We know that x is a 
character position, which when moved up results in a character position, which 
when moved down results in a character position that we'll call P(x). We also 
know that x is a character position that we'll call Q(x). Fact: whenever x is a 
character position we will have P(x)=Q(x). Summary: [up, down] = [ ] 

Given x, a character position, consider the following. We know that x is a 
character position, which when moved down results in a character position, 
which when moved up results in a character position that we'll call P(x). We also 
know that x is a character position that we'll call Q(x). Fact: whenever x is a 
character position we will have P(x)=Q(x). Summary: [down, up] = [ ] 

Given x, a character position, consider the following. We know that x is a 
character position, which when moved up results in a character position, which 
when moved right results in a character position that we'll call P(x). We also 
know that x is a character position, which when moved right results in a 
character position, which when moved up results in a character position that 
we'll call Q(x). Fact: whenever x is a character position we will have P(x)=Q(x). 
Summary: [up, right] = [right, up] 

Given x, a character position, consider the following. We know that x is a 
character position, which when moved down results in a character position, 
which when moved right results in a character position that we'll call P(x). We 
also know that x is a character position, which when moved right results in a 
character position, which when moved down results in a character position that 
we'll call Q(x). Fact: whenever x is a character position we will have P(x)=Q(x). 
Summary: [down, right] = [right, down] 



when moved up 
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3.1.2.10 Finite state machines 

According to Wikipedia, a deterministic finite state machine is a quintuple (S, 5, 5o, F), 
where 

1. S is a finite non-empty set of symbols, called the input alphabet, 

2. 5 is a finite, non-empty set, called the state set, 

3. S : X S ^ S is 8i function, called the state-transition function, and 

4. 5o e 5* is an element, called the initial state, 

5. F ^ is a subset, called the set of final states. 

In this book we will not worry about the initial state and the set of final states, 
concerning ourselves more with the interaction via S of the alphabet S on the set S of 
states. 



Figure 3.1: A finite state machine with alphabet S = {a^b} and state set S = 
{State 0, State 1, State 2}. If pressed, we will make State the initial state and {State 
2} the set of final states. 

The following proposition expresses the notion of finite state automata in terms of 
free monoids and their actions on finite sets. 

Proposition 3.1.2.11. Let S, S be finite non-empty sets. Giving a function 6: T^x S ^ 
S is equivalent to giving an action of the free monoid List(S) on S. 

Proof. By Definition 3.1.2.1, we know that function e: List(S) x S ^ S constitutes an 
action of the monoid List(i;) on the set S if and only if, for all s G we have e([ ], s) = s, 
and for any two elements m, E List(i;) we have e{m,e{m' ,s)) = e(m ★ m' ,s), where 
m' is the concatenation of lists. Let 




b 



A = {e: List(E) x S ^ S \ e constitutes an action}. 



We need to prove that there is an isomorphism of sets 



(/>: AiHomset(S x S,S). 
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Given an element e : List(E) x S S in define (j){e) on an element (a, 5) g S x 5 by 
^(e)(cr, s) := e([cr],5), where [a] is the one-element list. We now define ip: Homset(S x 
S, S) A. 

Given an element / g Homset(S x S^S)^ define List(i;) x 6* ^ 5* on a pair 

G List(i;) X where L = [ei,...,en] as follows. By induction, if n = 0, put 
s) = s;ifn^ 1, let = [ei, . . . , e^-i] and put s) = ^{f){L\ /(e,, s)). 

One checks easily that ^p{f) satisfies the two rules above, making it an action of List(I]) 
on S. ^ It is also easy to check that (j) and ^|J are mutually inverse, completing the proof. 

□ 

We sum up the idea of this section as follows: 
Slogan 3.1.2.12. 

" A finite state macliine is an action of a free monoid on a finite set. " 



Exercise 3.1.2.13. Consider the functions and ip above. 

a. ) Show that for any f'.T^xS^S^ the map ip{f): List(I]) x S ^ S constitutes an 

action. 

b. ) Show that (j) and i/j are mutually inverse functions (i.e. (f) o i/j = idHom(Sx5,5) and 

^l)0(j) = id^.) 



3.1.3 Monoid action tables 

Let M be a monoid generated by the set G = {^1, . . . ,^^}, and with some relations, 
and suppose that a: M x ^ is an action of M on a set S = {si, . . . , 5^}. We can 
represent the action a using an action table whose columns are the elements of G and 
whose rows are the elements of 5*. In each cell {row^col)^ where row g S and col g G, 
we put the element a(co/, row) g S. 

Example 3.1.3.1. If S and S are the sets from Figure 3.1, the displayed action of List(I]) 
on S would be given by the action table 



Action from 3.1 


ID 


a 


b 


State 


State 1 


State 2 


State 1 


State 2 


State 1 


State 2 


State 


State 



Example 3.1.3.2. Every monoid acts on itself by its multiplication formula, M x M ^ M. 
If G is a generating set for M, we can write the elements of G as the columns and the 
elements of M as rows, and call this a multiplication table. For example, let (M, 1,*) 
denote the multiplicative monoid of natural numbers. The multiplication table is as 



^This is technically a right-action. 
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follows: 



Multiplication of natural numbers 


N 


1 


2 


3 


4 


5 
























1 


1 


2 


3 


4 


5 




2 


2 


4 


6 


8 


10 




3 


3 


6 


9 


12 


15 




4 


4 


8 


12 


16 


20 


















21 


21 


42 


63 


84 


105 



















(3.2) 



Try to understand what is meant by this: "applying column 2 and then column 2 returns 
the same thing as applyng column 4." 

Exercise 3.1.3.3. Let N be the additive monoid of natural numbers, let S = {0,1,2,... ,11}, 
and let • : Mx 6* ^ /S be the action given in Example 3.1.2.3. Using a nice small generating 
set for the monoid, write out the corresponding action table. 

3.1.4 Monoid homomorphisms 

A monoid (M, id, ★) involves a set, an identity element, and a multiplication formula. For 
two monoids to be comparable, their sets, their identity elements, and their multiplication 
formulas should be appropriately comparable. For example the additive monoids M and 
Z should be comparable because M ^ Z is a subset, the identity elements in both cases 
are the same id = 0, and the multiplication formulas are both integer addition. 

Definition 3.1.4.1. Let M := (M, id, ★) and M' := (M',id',*0 be monoids. A monoid 
homomorphism f from Ad to denoted f : Ai ^ is a function f : M ^ M' 
satisfying two conditions: 

• /(id) = id', and 

• /(^i * ^2) = /(^i) f{'^2)i for all mi, m2 e M. 

The set of monoid homomorphisms from Ai to Ai' is denoted HomMon(A^, 

Example 3.1.4.2 (From M to Z). As stated above, the inclusion map z: M ^ Z induces a 
monoid homomorphism (M, 0, +) (Z, 0, +) because i(0) = and i{ni + 77,2) = ^(^1) + 

Let ^5 : M ^ Z denote the function i^{n) = 5 * n, so ^5(4) = 20. This is also a monoid 
homomorphism because ^5(0) = 5*0 = and ^5(711+712) = 5* (ni +712) = 5*7ii + 5*722 = 
^5(^1) + ^5(^2)- 

Application 3.1.4.3. Let R = {a^c^g^u} and let T = R^, the set of triplets in R. Let 
IZ = List(i?) be the free monoid on R and let T = List(T) denote the free monoid on 
T. There is a monoid homomorphism F: T ^ IZ given by sending t = (ri, r2, ra) to the 
list [ri,r2,r3]. 

If A be the set of amino acids and A = List (A) the free monoid on A, the process 
of translation gives a monoid homomorphism G : T ^ turning a list of RNA triplets 
into a polypeptide. But how do we go from a list of RNA nucleotides to a polypeptide? 
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The answer is that there is no good way to do this mathematicahy. So what is going 
wrong? 

The answer is that there should not be a monoid homomorphism 7Z ^ A because not 
ah sequences of nucleotides produce a polypeptide; for example if the sequence has only 
two elements, it does not code for a polypeptide. There are several possible remedies to 
this problem. One is to take the image of F, which is a submonoid IZ' ^ IZ. It is not 
hard to see that there is a monoid homomorphism F' : IZ' ^ T, and we can compose it 
with G to get our desired monoid homomorphism G o F' : IZ' ^ A. ^ 

00 

Example 3.1.4.4. Given any monoids Ai there is a unique monoid homomorphism from 
A4 to the trivial monoid 1 (see Example 3.1.1.8). There is also a unique homomorphism 
l^M. These facts together have an upshot: between any two monoids M. and we 
can always construct a homomorphism 

M 1 M' 

which we call the trivial homomorphism Ai Ai' . A morphism Ai Ai' that is not 
trivial is called a nontrivial homomorphism. 

Proposition 3.1.4.5. Let Ai = (Z, 0, +) and Ai' = (N, 0, +). The only monoid homo- 
morphism f : Ai ^ Ai' sends every element m e Z to e N. 

Proof. Let f : Ai ^ Ai' be a monoid homomorphism, and let n = /(I) and n' = /(—I) 
in N. Then we know that since = 1 + (— 1) in Z we must have = /(O) = /(I + (— 1)) 
,/'( 1 ) + /(—I) = n -\- n' e N. But if n ^ 1 then this is impossible, so n = 0. Similarly 
n' = 0. Any element m g Z can be written m = l + l + -- - + lorasm = — 1 + — 1 + -- • + — 1, 
and it is easy to see that /(I) + /(I) + • • • + /(I) = = /(-I) + /(-I) + • • • + /(-I). 
Therefore, /(m) = for all m g Z. 

□ 

Exercise 3.1.4.6. For any m g M let : M ^ Z be the function im{'^) = m ^ n. All such 
functions are monoid homomorphisms. Do any monoid homomorphisms (M, 0, +) 
(Z, 0, +) not come in this way? For example, what about using n^5*n — 1 or n^^n^, 
or some other function? 

Exercise 3.1.4.7. Let Ai := (M, 0, +) be the additive monoid of natural numbers, let A/" = 
(R^0 5 +) be the additive monoid of nonnegative real numbers, and let 7^ := (IR>o, 1, *) 
be the multiplicitive monoid of positive real numbers. Can you think of any nontrivial 
monoid homomorphisms of the following sorts: 

Ai^^, Ai^V, M^V, M^Ai, V^Ml 



3.1.4.8 Homomorphisms from free monoids 

Recall that (M, 0, +) is the free monoid on one generator. It turns out that for any other 
monoid Ai = (M, id, ★), the set of monoid homomorphisms M ^ is in bijection with 
the set M. This is a special case (in which G is a set with one element) of the following 
proposition. 



^Adding stop-codons to the mix we can handle more of 7^, e.g. sequences that don't have a multiple- 
of-three many nucleotides. 
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Proposition 3.1.4.9. Let G be a set, let F{G) := (List(G), [ ], ^) be the free monoid 
on G, and let M := (M, id, ★) be any monoid. There is a natural bisection 

HomMon(i^(G), ^ Homset(G,M). 

Proof. We provide a function (j): HomMon(^(G), TW) Homset(G, M) and a function 
\ Homset(G,M) HomMon(^(G), A^) and show that they are mutually inverse. Let 
us first construct (j). Given a monoid homomorphism /: F{G) ^ we need to provide 
(/)(/) : G - M. Given any ^ g G we define ^{f){g) := f{[g]). 

Now let us construct ip. Given p: G ^ M, we need to provide ilj{p) : List(G) M 
such that il){p) is a monoid homomorphism. For a list L = [^i, . . . ^ List(G), define 
ip{p){L) := p{gi) * • • • ^ p{gn) ^ M. In particular, ]) = id. It is not hard to see 

that this is a monoid homomorphism. It is also easy to see that (j) o i/;[p) = p for all 
p e Homset (G,M). We show that o ^(/) = f for ah / e HomMon(^(G), A^). Choose 
L=[gi,...,gn]e List(G). Then 

= W){9i) * • • • * W){9n) = f[gi] * • • • * f[gn] = /(bi, . . . ,^n]) = f{L). 

□ 

Exercise 3.1.4.10. Let G = {a, 6}, let M := (M, id, ★) be any monoid, and let / : G ^ M 
be given by /(a) = m and f{b) = n, where m^n e M. If i/j: Homset (G,M) 
HomMon(^(G), A^) is the function from the proof of Proposition 3.1.4.9 and L = 
[a, a, 6, a, 6], what is tlj{f){L) ? 

3.1.4.11 Restriction of scalars 

A monoid homomorphism f : M ^ M' (see Definition 3.1.4.1) ensures that the elements 
of M have a reasonable interpretation in M'] they act the same way over in M' as 
they did back home in M. If we have such a homomorphism / and we have an action 
a: M' X S ^ S of M' on a set S*, then we have a method for allowing M to act on S as 
well. Namely, we take an element of M, send it over to M', and act on 5*. In terms of 
functions, we compose a with the function (/, id^) : M x S ^ M' x S^io get a function 
we'll denote 

/\f{a): M X S ^ S. 

After Proposition 3.1.4.12 we will know that A/ (a) is indeed a monoid action, and we 
say that it is given by restriction of scalars along f. 

Proposition 3.1.4.12. Let (M, id, ★) and (M',id',V) be monoids, f : M ^ M' a 
monoid homomorphism, S a set, and suppose that a: M' x S ^ S is an action of 
M' on S. Then Af{a) : M x S ^ S, defined as above, is a monoid action as well. 

Proof. Refer to Remark 3.1.2.2; we assume a is a monoid action and want to show that 
A/(a) is too. We have A/(a)(id, 5) = a(/(id),5) = a{id\s) = s. We also have 

A/(a)(m, A/(a)(n, s)) = a{f{m), a{f{n),s)) = a{f{m) /(n), s) 

= a{f{m ★ n), s) 
= Af{a){m ★ n, s). 



□ 
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Example 3.1.4.13. Let M and Z denote the additive monoids of natural numbers and 
integers, respectively, and let M ^ Z be the inclusion, which we saw in Example 
3.1.4.2 is a monoid homomorphism. There is an action a: Z x R ^ R of the monoid 
Z on the set R of real numbers, given by a(n, x) = n -\- x. Clearly, this action works 
just as well if we restrict our scalars to M ^ Z, allowing ourselves only to add natural 
numbers to reals. The action A^a : M x R ^ R is given on (n, x) g M x R by A^a(n, x) = 
a{i{n),x) = a{n^ x) = n + x, just as expected. 

Example 3.1.4.14. Suppose that V is a complex vector space. In particular, this means 
that the monoid C of complex numbers (under multiplication) acts on the elements of V. 
If i : R ^ C is the inclusion of the real line inside C, then i is a monoid homomorphism. 
Restriction of scalars in the above sense turns V into a real vector space, so the name 
"restriction of scalars" is apt. 

Exercise 3.1.4.15. Let M be the free monoid on one generator, let S = {^^^^j, and let 
S = {State 0, State 1, State 2}. Consider the map of monoids / : M ^ List(i;) given by 
sending 1 ^ [a, 6, 6]. The monoid action a: List(i;) x S ^ S given in Example 3.1.3.1 
can be transformed by restriction of scalars along / to an action A/ (a) of M on S. Write 
down its action table. 



3.2 Groups 

Groups are monoids in which every element has an inverse. If we think of these structures 
in terms of how they act on sets, the difference between groups and monoids is that the 
action of every group element can be undone. One way of thinking about groups is in 
terms of symmetries. For example, the rotations and reflections of a square form a group. 

Another way to think of the difference between monoids and groups is in terms of 
time. Monoids are likely useful in thinking about diffusion, in which time plays a role and 
things cannot be undone. Groups are more likely useful in thinking about mechanics, 
where actions are time-reversible. 

3.2.1 Definition and examples 

Definition 3.2.1.1. Let (M, id, ★) be a monoid. An element m g M is said to have 
an inverse if there exists an g M such that mm' = id and m'm = id. A group is a 
monoid (M, id, ★) in which every element m e M has an inverse. 

Proposition 3.2.1.2. Suppose that (M, id, ★) is a group and let m e M be an element. 
Then m has only one inverse. 

Proof. Suppose that both m' and m'^ are inverses of m; we want to show that m' = m" . 
This follows by the associative law for monoids: 

m = m {mm") = {m'm)m" = m" . 

□ 

Example 3.2.1.3. The additive monoid (M, 0, +) is not a group because none of its ele- 
ments are invert ible, except for 0. However, the monoid of integers (Z, 0, +) is a group. 
The monoid of clock positions from Example 3.1.1.22 is also a group. For example the 
inverse of is because = id = -k . 
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Example 3.2.1.4. Consider a square centered at the origin in R^. It has rotational and 
mirror symmetries. There are eight of these, which we denote 

{id, p, p^ p^, ^, ^p, ^p^}, 

where p stands for 90° counterclockwise rotation and <p stands for horizontal-flip (across 
the vertical axis). So relations include = id, 0^ = id, and p^cj) = (pp. 

Example 3.2.1.5. The set of 3 x 3 matrices can be given the structure of a monoid, where 
the identity element is the 3x3 identity matrix, the multiplication is matrix multiplica- 
tion. The subset of invertible matrices forms a group, called the general linear group of 
dimension 3 and denoted GL3. Inside of GL3 is the so-called orthogonal group, denoted 
O3, of matrices M such that = . These matrices correspond to symmetries of 

the sphere centered at the origin. 

Another interesting group is the Euclidean group £^(3) which consists of all isometries 
of R^, i.e. all functions R^ R^ that preserve distances. 

Application 3.2.1.6. In crystallography one is often concerned with the symmetries that 
arise in the arrangement A of atoms in a molecule. To think about symmetries in terms of 
groups, we first define an atom- arrangement to be a finite subset z : A ^ R^. A symmetry 
in this case is an isometry of R^ (see Example 3.2.1.5), say /: R^ ^ R^ such that there 
exists a dotted arrow making the diagram below commute: 

A- - ^A 

i i 

R3 ^R^ 

/ 

That is, it's an isometry of R^ such that each atom of A is sent to a position currently 
occupied by an atom of A. It is not hard to show that the set of such isometries forms 
a group, called the space group of the crystal. 

00 

Exercise 3.2.1.7. Let 5 be a finite set. A permutation of S is an isomorphism /: S ^ S. 

a. ) Come up with an identity, and a multiplication formula, such that the set of permu- 

tations of S forms a monoid. 

b. ) Is it a group? 



Exercise 3.2.1.8. In Exercise 3.1.1.23 you classified the cyclic monoids. Which of them 
are groups? 

Definition 3.2.1.9 (Group action). Let (G,id,*) be a group and S a set. An action of 
G on 5 is a function C^: G x S ^ S such that for all s g 5* and g, g' e G, we have 

• id Q 5 = 5 and 

• g {g' s) = {g-k g') q s. 

In other words, considering G as a monoid, it is an action in the sense of Definition 
3.1.2.1. 
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Example 3.2.1.10. When a group acts on a set, it has the character of symmetry. For 
example, consider the group whose elements are angles 0. This group may be denoted 
U{1) and is often formalized as the unit circle in C of complex numbers z = a + hi 
such that \z\ = o? -\- iP' = 1. The set of such points is given the structure of a group 
([/(I), id, ★) by defining the identity element to be id := 1 + Oi and the group law to be 
complex multiplication. But for those unfamiliar with complex numbers, this is simply 
angle addition where we understand that 360° =0°. If l9i = 190° and O2 = 278°, then 
Oi* O2 = 468 = 108. In the language of complex numbes, z = e'^^ . 

The group U{1) acts on any set that we can picture as having rotational symmetry 
about a fixed axis, such as the earth around the north-south axis. We will define S = 
{{x^y^z) eR^ \ x'^ -\- y'^ -\- z'^ = 1}, the unit sphere, and understand the rotational action 
of U{1) on S. 

We first show that U{1) acts on by ^ Q (x^y^z) = {xcosO + ysinO^—xsinO + 
ycosO^ z), or with matrix notation as 

/ cos(i9) sin((9) \ / ^ \ 
Of {x,y,z) := i -sm{0) cos{0) ^] \ V ] 

Trigonometric identities ensure that this is indeed an action. 

In terms of action tables, we would need infinitely many columns to express this 
action. Here is a sample 



Action of /7(1) on 




e = 45° 


6» = 90° 


e = 100° 


(0,0,0) 


(0,0,0) 


(0,0,0) 


(0,0,0) 


(1,0,0) 


(.Tl,.71,0) 


(0,1,0) 


(-.17,.98,0) 


(0,1,-4.2) 


(-.71,.71,-4.2) 


(-1,0,-4.2) 


(-.98,-.17,-4.2) 


(3,4,2) 


(4.95,.71,2) 


(-4,3,2) 


(3.42,-3.65,2) 











Finally, we are looking to see that the action preserves length so that if (x, y^ z) e S 
then Q (x, z) e S] this way we will have confirmed that U{1) indeed acts on S. The 
calculation begins by assuming x'^ -\- y'^ -\- z'^ = 1 and checks 

{x cos -\- ysin 0)'^ + {—x sinO -\- y cos 0)'^ -\- z"^ = x'^ -\- y'^ -\- z'^ = 1. 

Exercise 3.2.1.11. Let X be a set and consider the group of permutations of X (see 
Exercise 3.2.1.7), which we will denote J^x- Find a canonical action of T^x on X. 

Definition 3.2.1.12. Let G be a group acting on a set X. For any point x g X, the 
orbit of x^ denoted Gx, is the set 

Gx := {x^ E X \ 3g E G such that gx = x^}. 

Application 3.2.1.13. Let S be the surface of the earth, understood as a sphere, and let 
G = U{1) be the group of angles acting on S as in Example 3.2.1.10. The orbit of any 
point p = (x, y^ z) E S is the set of points on the same latitude line as p. 

One may also consider a small band around the earth, i.e. the set A = {{x^y^ z) \ 1.0 ^ 
x'^ + y'^ + z'^ ^ 1.05}. The action of U{1) Q S extends to an action U{1) Q A. The 
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orbits are latitude-lines-at-altitude. A simplifying assumption in climatology may be 
given by assuming that U{1) acts on all currents in the atmosphere in an appropriate 
sense. That way, instead of considering movement within the whole space A, we only 
allow movement that behaves the same way throughout each orbit of the group action. 

00 

Exercise 3.2.1.14. a.) Consider the U{1) action on given in Example 3.2.1.10. De- 
scribe the set of orbits of this action. 

b.) What are the orbits of the action of the permutation group 2,3} the set {1, 2, 3}? 
(See Exercise 3.2.1.11.) 



Exercise 3.2.1.15. Let G be a group and X a set on which G acts by Q: G x X ^ X. 
Is "being in the same orbit" an equivalence relation on X7 

Definition 3.2.1.16. Let G and G' be groups. A group homomorphism f : G ^ G' is 
defined to be a monoid homomorphism G ^ G\ where G and G' are being regarded as 
monoids in accordance with Definition 3.2.1.1. 



3.3 Graphs 

In this course, unless otherwise specified, whenever we speak of graphs we are not talking 
about curves in the plane, such as parabolas, or pictures of functions generally. We are 
speaking of systems of vertices and arrows. 

We will take our graphs to be directed^ meaning that every arrow points from a vertex 
to a vertex; rather than merely connecting vertices, arrows will have direction. If a and 
b are vertices, there can be many arrows from a to 6, or none at all. There can be arrows 
from a to itself. Here is the formal definition in terms of sets and functions. 

3.3.1 Definition and examples 

Definition 3.3.1.1. A graph G consists of a sequence G := (V^ A^ src^tgt) where 

• V is a set, called the set of vertices of G (singular: t'ertea;), 

• A is a set, called the set of arrows of G, 

• src: A ^ V is di function, called the source function for G, and 

• tgt: A ^ V is di function, called the target function for G. 

Given an arrow a e A we refer to src{a) as the source vertex of a and to tgt{a) as the 
target vertex of a. 

To draw a graph, first draw a dot for every element of V . Then for every element 
a E A, draw an arrow connecting dot src{a) to dot tgt{a). 
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Example 3.3.1.2. Here is a picture of a graph G = {V, A, src, tgt): 



G:- 




(3.3) 



We have V 
src, tgt: A - 



= {v^w^x^y^z} and A = {f,g,h,i,j,k}. The source and target functions 
V can be captured in the table to the left below: 



A 


src 


tgt 


/ 


V 


w 


g 


w 


X 


h 


w 


X 


i 


y 


y 


j 


y 


z 


k 


z 


y 



w 



In fact, all of the data of the graph G is captured in the two tables above — together they 
tell us the sets A and V and the functions src and tgt. 

Example 3.3.1.3. Every olog has an underlying graph. The additional information in an 
olog has to do with which pairs of paths are declared equivalent, as well as text that has 
certain English-readability rules. 

Exercise 3.3.1.4. Write down a function f: A ^ where A = {1,2,3,4,5} and B = 

{a,6, c}. Let A u B he the coproduct of A and B (Definition 2.4.2.1) and let A ^ 

Au B ^ B be the two inclusions. Consider the two functions src, tgt: A ^ A ia B, 

where src = ii and tgt is the composition A ^ B Au B. Draw the associated graph 
{A u B,A,src,tgt). 

Exercise 3.3.1.5. a.) Let V be a set. Suppose we just draw the elements of V as vertices 
and have no arrows between them. Is this a graph? 

b.) Given V, is there any other "canonical" or somehow automatic non-random proce- 
dure for generating a graph with those vertices? 



Example 3.3.1.6. Recall from Construction 2.5.2.5 the notion of bipartite graph, which 

we defined to be a span (i.e. pair of functions, see Definition 2.5.2.1) A<^ B. Now 
that we have a formal definition of graph, we might hope that bipartite graphs fit in, 
and they do. Let V = A u B and let i: A ^ V and j : B ^ V he the inclusions. Let 
src = i o f: R ^ V and let tgt = j o g: R ^ V he the composites. 
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Then (V, src^ tgt) is a graph that would be drawn exactly as we specified the drawing 
of spans in Construction 2.5.2.5. 

Example 3.3.1.7. Let G = (V^ A^ src^tgt) be a graph; we want to spread it out over 
discrete time, so that each arrow does not occur within a given time-slice but instead 
over a quantum unit of time. 

Let N = (M, M, n ^ n, n ^ n + 1) be the graph depicted 

1 1 2 2 

• ^ • ^ • ^ • • • 

When we get to limits in a category, we will understand that products can be taken in 
the category of graphs (see Example 4.5.1.5, and N x G will make sense. For now, we 
construct it by hand. 

Let T{G) = (V X M, A X M, src'^ cod') be a new graph, where for a g A and n e M we 
have src\a^ n) := {src{a)^ n) and tgt\a^ n) = (tgt{a)^n + 1). This may be a bit much to 
swallow, so try to simply understand what is being done in the following example. 

Let G be the graph drawn below 



/ 




Then T(G) wih be the graph 



aO /O al fl a2 f2 
• ^ • ^ • 




As you can see, /-arrows still take a's to a's and ^-arrows still take a's to 6's, but they 
always march forward in time. 

Exercise 3.3.1.8. Let G be the graph depicted below: 




Draw (using ellipses • • " if necessary) the graph T{G) defined in Example 3.3.1.7. 
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(y, A, src, tgt) depicted below, 
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(0, 2) ^ (1, 2) ^ (2, 2) ^ • • . 

(0, 1) (1, 1) (2, 1) ^ • • • 

(0,0) ^ (1,0) ^ (2,0) ^ • • • 

a. ) Write down the sets A and E. 

b. ) What are the source and target function A VI 



Exercise 3.3.1.10. A graph is a pair of functions E ^ V. This sets up the notion of 
equahzer and coequahzer (see Definitions 2.5.3.1 and 2.6.3.1). 

a. ) What feature of a graph is captured by the equalizer of its source and target func- 

tions? 

b. ) What feature of a graph is captured by the coequahzer of its source and target 

functions? 



3.3.2 Paths in a graph 

We all know what a path in a graph is. The following definition makes this precise. In 
particular, one can have paths of any finite length n G N, even length or 1. Also, we 
want to be able to talk about the source vertex and target vertex of a path, as well as 
composition of paths. 

Definition 3.3.2.1. Let G = (V^ A^ src^tgt) be a graph. A path of length n in G, 
denoted p e Path[?^ is a head-to-tail sequence 

/ CLl a2 CL3 Cin \ /r, a\ 

P= [VO ^Vl ^V2 ^ ^Vn) (3.4) 

of arrows in G. In particular, Path[^^ = A and Path[?^ = V] we refer to the path of 
length on vertex v as the trivial path on v and denote it by id^ . We denote by Pathc 
the set of all paths on G, 

PathG := y Path[?\ 

nGN 

Every path p E Pathc has a source vertex and a target vertex, and we may denote these 
by src^tgt: Path^ V. If p is a path with src{p) = v and tgt{p) = we may denote 
it hy p: V ^ w. Given two vertices v^w e we write PathG'('^, w) to denote the set of 
all paths p: v ^ w. 
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There is a concatenation operation on paths. Given a path p: v ^ w and q: w ^ 
we define the concatenation, denoted pq: v ^ x in the obvious way. In particular, if p 
(resp. r) is the trivial path on vertex v (resp. vertex w) then for any path q: v ^ we 
have pq = q (resp. qr = q). Thus, for clarity, we may always denote a path as beginning 
with a trivial path on its source vertex; e.g. the path p from Diagram (3.4) may be 
denoted p = id^Qaia2 • • • dn- 

Example 3.3.2.2. In Diagram (3.3), page 79, there are no paths from v io one path 
(/) from V to two paths {fg and fh) from v to x, and infinitely many paths 

{i^'{jky' • •■i^-{jkY- I n,pi,^i, . . . G M} 

from y to y. 

Exercise 3.3.2.3. How many paths are there in the following graph? 

1/2^3 

• ^ • ^ • 



Exercise 3.3.2.4. Let G be a graph and consider the set Path^ of paths in G. Suppose 
someone claimed that there is a monoid structure on the set Pathc, where the multipli- 
cation formula is given by concatenation of paths. Are they correct? Why or why not? 
Hint: what should be the identity element? 



3.3.3 Graph homomorphisms 

A graph (F, A, src, tgt) involves two sets and two functions. For two graphs to be com- 
parable, their two sets and their two functions should be appropriately comparable. 

Definition 3.3.3.1. Let G = {V, A, src.tgt) and G' = {V , A' , src' ,tgt') be graphs. A 
graph homomorphism f from G to G\ denoted /: G G^ consists of two functions 
fo' V ^ and j\ \ A^ A! such that the two diagrams below commute: 



(3.5) 



tgt 



tgt' 



V 



v 



V 



fo 



Remark 3.3.3.2. The above conditions (3.5) may look abstruse at first, but they encode 
a very important idea, roughly stated "arrows are bound to their vertices". Under a map 
of graphs G ^ G' ^ one cannot flippantly send an arrow of G any old arrow of G': it 
must still connect the vertices it connected before. Below is an example of a mapping 
that does not respect this condition: e connects 1 and 2 before, but not after: 



1 o Ih^I', 2i-^2' ai-^a' w o' / 

• ^ • • • ^ • 



The commutativity of the diagrams in (3.5) is exactly what is needed to ensure that 
arrows are handled in the expected way by a proposed graph homomorphism. 
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Example 3.3.3.3. Let G = (V, A, src, tgt) and G' = {V ^ A\ src\ tgt') be the graphs drawn 
to the left and right (respectively) below: 





a 


2 


h 3 






^ • - 


^ • 


•c. 


)' 






• 




5 

• - 


e 6 
^ • 



5 ^ 5^6 ^ 5' 




(3.6) 



The colors indicate our choice of function /o : V . Given that choice, condition (3.5) 
imposes in this case that there is a unique choice of graph homomorphism /: G ^ G' . 

Exercise 3.3.3.4. a.) Where are a, 6, c, e sent under fi: A ^ A' in Diagram (3.6)? 

b.) Choose a couple elements of A and check that they behave as specified by Diagram 
(3.5). 





Exercise 3.3.3.5. Given a morphism of graphs /: G 
Path(/): Path(G) ^ Path(GO. 



G' there an induced function 



a. ) Is it the case that for every n g M, the function Path(/) carries Path(^^(G) to 

Path*^"^^(G'), or can path lengths change in this process? 

b. ) Suppose that /o and /i are inject ive (meaning no two distinct vertices in G are sent 

to the same vertex (respectively for arrows) under /). Does this imply that Path(/) 
is also injective (meaning no two distinct paths are sent to the same path under /)? 

c. ) Suppose that /o and /i are surjective (meaning every vertex in G' and every arrow 

in G' is in the image of /). Does this imply that Path(/) is also surjective? Hint: at 
least one of the answers to these three questions is "no". 



Exercise 3.3.3.6. Given a graph (V, A, src, tgt)^ lei i: A ^ V xV he function guaranteed 
by the universal property for products, as applied to src^tgt: A ^ V. One might hope 
to summarize Condition (3.5) for graph homomorphisms by the commutativity of the 
single square 



A' 



(3.7) 



V xV 



/ox/o 



V X v. 



Is the commutativity of the diagram in (3.7) indeed equivalent to the commutativity of 
the diagrams in (3.5)? 



3.3.3.7 Binary relations and graphs 

Definition 3.3.3.8. Let X be a set. A binary relation on X is a subset R <^ X x X . 

If X = M is the set of integers, then the usual ^ defines a relation on X: given 
(m, n) G M X M, we put (m, n) e R iS m ^ n. As a table it might be written as to the 
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left 



m ^ n 


m 


n 











1 


1 


1 





2 


1 


2 


2 


2 





3 







n = 


5m 


m 


n 








1 


5 


2 


10 


3 


15 


4 


20 


5 


25 


6 


30 







\n — m\ ^ 1 


m 


n 











1 


1 





1 


1 


1 


2 


2 


1 


2 


2 







(3., 



The middle table is the relation {(m, n) g M x M | n = 5m} ^ M x M and the right-hand 
table is the relation {(m, n) g M x M | |n — m| ^ 1} ^ M x M. 

Exercise 3.3.3.9. A relation on R is a subset of R x R, and one can indicate such a subset 
of the plane by shading. Choose an error bound e > and draw the relation one might 
refer to as "e-approximation". To say it another way, draw the relation is within e of 



Exercise 3.3.3.10 (Binary relations to graphs), a.) U R ^ S x S is di binary relation, find 
a natural way to make a graph out of it, having vertices S. 

b. ) What is the set A of arrows? 

c. ) What are the source and target functions src^tgt: A ^ S7 

d. ) Take the left-hand table in (3.8) and consider its first 7 rows (i.e. forget the :). Draw 

the corresponding graph (do you see a tetrahedron?). 

e. ) Do the same for the right-hand table. 



Exercise 3.3.3.11 (Graphs to binary relations), a.) If {V, A^ src^tgt) is a graph, find a 
natural way to make a binary relation R V x V out of it. 

b.) Take the left-hand graph G from (3.6) and write out the corresponding binary relation 
in table form. 



Exercise 3.3.3.12 (Going around the loops), a.) Given a binary relation R ^ S x you 
know from Exercise 3.3.3.10 how to construct a graph out of it, and from Exercise 
3.3.3.11 how to make a new binary relation out of that. How does the resulting 
relation compare with the original? 

b.) Given a graph {V^ A^ src^tgt), you know from Exercise 3.3.3.11 how to make a new 
binary relation out of it, and from Exercise 3.3.3.10 how to construct a new graph 
out of that. How does the resulting graph compare with the original? 





3.4 Orders 



People usually think of certain sets as though they just are ordered, e.g. that an order is 
ordained by God. For example the natural numbers just are ordered. The letters in the 
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alphabet just are ordered. But in fact we put orders on sets, and some are simply more 
commonly used in culture. One could order the letters in the alphabet by frequency of 
use and e would come before a. Given different purposes, we can put different orders 
on the same set. For example in Exercise 4.5.1.4 we will give a different ordering on the 
natural numbers that is useful in elementary number theory. 

In science, we might order the set of materials in two different ways. In the first, we 
consider material A to be "before" material B if A is an ingredient or part of 5, so water 
would be before concrete. But we could also order materials based on how electrically 
conductive they are, whereby concrete would be before water. This section is about 
different kinds of orders. 

3.4.1 Definitions of preorder, partial order, linear order 

Definition 3.4.1.1. Let /S be a set and R S x S Si binary relation on if (s, s') e R 
we will write 5 ^ 5^ Then we say that is a preorder if, for all s^s\s'' e S we have 

Reflexivity: s ^ 5, and 

Transitivity: if 5 ^ and s' ^ s"^ then s ^ s'\ 

We say that is a partial order if it is a preorder and, in addition, for all s^s' e S we 
have 

Antisymmetry: If 5 ^ and s' ^ 5, then s = s'. 

We say that is a linear order if it is a partial order and, in addition, for all 5, 5' g 6* 
we have 

Comparability: Either s ^ or ^ s. 

We denote such a preorder (or partial order or linear order) by (6', ^). 

Exercise 3.4.1.2. a.) Decide whether the table to the left in Display (3.8) constitutes a 
linear order. 

b.) Show that neither of the other tables are even preorders. 



Example 3.4.1.3 (Partial order not linear order). We will draw an olog for playing cards. 




86 CHAPTER 3. CATEGORIES AND FUNCTORS, WITHOUT ADMITTING IT 



We can put a binary relation on the set of boxes here by saying A ^ 5 if there is a 
path A ^ B. One can see immediately that this is a preorder because length=0 paths 
give reflexivity and concatenation of paths gives transitivity. To see that it is a partial 
order we only note that there are no loops. But this partial order is not a linear order 
because there is no path (in either direction) between, e.g., "^a 4 of diamonds^ and "^a 
black queen\ so it violates the comparability condition. 

Remark 3.4.1.4. Note that olog (3.9) in Example 3.4.1.3 is a good olog in the sense that 
given any collection of cards (e.g. choose 45 cards at random from each of 7 decks and 
throw them in a pile), they can be classified according to the boxes of (3.9) such that 
every arrow indeed constitutes a function (which happens to be injective). For example 
the arrow "^a heart^ — ^^-^ "^a red card^ is a function from the set of chosen hearts to the 
set of chosen red cards. 

Example 3.4.1.5 (Preorder not partial order). Every equivalence relation is a preorder 
but rarely are they partial orders. For example if S = {1,2} and we put R = S x then 
this is an equivalence relation. It is a preorder but not a partial order (because 1^2 
and 2^1, but 1 7^ 2, so antisymmetry fails). 

Example 3.4.1.6 ( [Finite linear orders). Let n g N be a natural number. Define a linear 
order on the set {0, 1, 2, . . . , n} in the standard way. Pictorially, 

. -, 1 2 n 

[nj := • ^ • ^ • ^ • • • ^ • 

Every finite linear order, i.e. linear order on a finite set, is of the above form. That 
is, though the labels might change, the picture would be the same. We can make this 
precise when we have a notion of morphism of orders (see Definition 3.4.3.1) 

Exercise 3.4.1.7. Let S = {1,2,3,4}. 

a. ) Find a preorder R <^ S x S such that the set R is as small as possible. Is it a partial 

order? Is it a linear order? 

b. ) Find a preorder R^ <^ S x S such that the set R^ is as large as possible. Is it a partial 

order? Is it a linear order? 



Exercise 3.4.1.8. a.) List all the preorder relations possible on the set {1,2}. 

b. ) For any n E M, how many linear orders exist on the set {1, 2, 3, . . . , n}. 

c. ) Does your formula work when n = 07 



Remark 3.4.1.9. We can draw any preorder (5, ^) as a graph with vertices X and with 
an arrow a ^ b if a ^ b. These are precisely the graphs with the following two properties 
for any vertices a, 6 G S: 

1. there is at most one arrow a ^ b^ and 

2. if there is a path from a to 6 then there is an arrow a ^ b. 

If (6*, ^) is a partial order then the associated graph has an additional "no loops" prop- 
erty, 

3. if n E M is an integer with n ^ 2 then there are no paths of length n that start 
at a and end at a. 
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If (5, ^) is a linear order then there is an additional "comparability" property, 

4. for any two vertices a, b there is an arrow a ^ 6 or an arrow b ^ a. 

Given a graph G, we can create a binary relation ^ on its set 5* of vertices as follows. 
Say a ^ b if there is a path in G from a to b. This relation will be reflexive and transitive, 
so it is a preorder. If the graph satisfies Property 3 then the preorder will be a partial 
order, and if the graph also satisfies Property 4 then the partial order will be a linear 
order. Thus graphs give us a nice way to visualize orders. 

Exercise 3.4.1.10. a.) Let S = {1,2}. The subsets of S form a partial order; draw the 
associated graph. 

b. ) Repeat this for Q = 0, R = {1}, and T = {1, 2, 3}. 

c. ) Do you see n-dimensional cubes? 



Definition 3.4.1.11. Let (5, ^) be a preorder. A clique is a subset S' S such that 
for each a^b e S' one has a ^b. 

Exercise 3.4.1.12. True or false: a partial order is a preorder that has no cliques. (If 
false, is there a "nearby" true statement?) 

Example 3.4.1.13. Let X be a set and ^ X x X a relation. Let 

R' :=Rvj {(x, x) 6 X X X I X 6 X}, 

so that R' is refiexive. Let 

R" := R' u {(x, y) e X X X \ 3z e X such that (x, z) e R' , and (z, y) e R'}, 

so that R'' is both transitive and refiexive, hence a preorder. We call R'^ the preorder 
generated by R. 

Exercise 3.4.1.14. Let X be the set of people and let ^ X x X be the relation with 
(x, y) E R if X is the child of y. Describe the preorder generated by R. 

3.4.2 Meets and joins 

Let X be any set. Recall from Definition 2.7.4.9 that the powerset of X, denoted P(X) is 
the set of subsets of X. There is a natural order on P(X) given by the subset relationship, 
as exemplified in Exercise 3.4.1.10. Given two elements a, 6 g P(X) we can consider them 
as subsets of X and take their intersection as an element of P(X) which we denote a a 6. 
We can also consider them as subsets of X and take their union as an element of P(X) 
which we denote a v b. The intersection and union operations are generalized in the 
following definition. 

Definition 3.4.2.1. Let (S*, ^) be a preorder and let a^b e S he elements. A meet of a 
and b is an element ce S such that 

• c ^ a and c ^b and, 

• for any d E if d ^ a and d ^ b then d ^ c. 
A join of a and b is an element ce S such that 
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• a ^ c and b ^ c and, 

• for any d E if a ^ d and b ^ d then c ^ d. 

That is, the meet of a and b is the biggest thing smaher than both, and the join of a 
and b is the smahest thing bigger than both. 

Exercise 3.4.2.2. Let (S*, ^) be a partial order, and let a^b e S he elements. Suppose that 
c and c' are both meets of a and b. Then c ^ c' and c' ^ c so c = c' by antisymmetry. 

a. ) Will the same argument style hold for joins, to show that any two joins of a and b 

are the same in a partial order? 

b. ) What about in a preorder — can a and b have more than one meet? 



Exercise 3.4.2.3. Consider the partial order from Example 3.4.1.3. 

a. ) What is the join of "^a diamond^ and "^a heart^? 

b. ) What is the meet of "^a black card^ and "^a queen^? 

c. ) What is the meet of "^a diamond^ and "^a card^? 



Not every two elements in a preorder need have a meet, nor need they have a join. 

Exercise 3.4.2.4. a.) Find two elements in the partial order from Example 3.4.1.3 that 
do not have a meet. ^ 

b.) Find two elements that do not have a join (in that preorder). 



Definition 3.4.2.5. Let S := (6', ^) be a preorder. The opposite preorder, denoted 
is the preorder (S*, having the same set of elements but where s 5' iff 5' ^ s. 

Example 3.4.2.6. Recall the preorder JV := (M, divides) from Exercise 4.5.1.4. Then 
AT^P is the set of natural numbers but where m ^ n iff m is a multiple of n. So 6 ^ 2 
and 6^3. 

Exercise 3.4.2.7. Suppose that S := (5, ^) is an preorder. 

a. ) If <S is a partial order, is <S°p also a partial order? 

b. ) If 5 is a linear order, is 5°^ a linear order? 



3.4.3 Morphism of orders 

An order (5, ^), be it a preorder, a partial order, or a linear order, involves a set and a 
binary relations. For two orders to be comparable, their sets and their relations should 
be appropriately comparable. 

Definition 3.4.3.1. Let S := {S, ^) and 5' := (5", ^') be preorders (respectively partial 
orders or linear orders). A morphism of preorders (resp. of partial orders or of linear 
orders) f from S to S\ denoted /: S S', is a function /: S ^ S' such that, for every 
pair of elements 5i,52 e 5, if si ^ 52 then f{si) f{s2). 

^Use the displayed preorder, not any kind of "completion of what's there". 
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Example 3.4.3.2. Let X and Y be sets, let /: X ^ F be a function. Then for every 
subset X' ^ X, its image f{X^) ^ F is a subset (see Section 2.1.2). Thus we have a 
function F: P(X) P(F), given by taking images. This is a morphism of partial orders 
(F(X), ^ (P(Y-), Indeed, if a ^ 6 in P(X) then /(a) c in ¥{Y). 

Application 3.4.3.3. It's often said that "a team is only as strong as its weakest member". 
Is this true for materials? The hypothesis that a material is only as strong as its weakest 
constituent can be understood as follows. 

Recall from the introduction to this section (see 3.4, page 84) that we can put several 
different orders on the set M of materials. One example there was the order given by 
constituency (m if ^ is an ingredient or constituent of m'). Another order is 

given by strength: m ^5 if is stronger than m (in some fixed setting). 

Is it true that if material m is a constituent of material m' then the strength of 
m' is less than or equal to the strength of m? This is the substance of our quote 
above. Mathematically the question would be posed, "is there a morphism of preorders 
(M,^c)^(M, ^^P)?" 

00 

Exercise 3.4.3.4. Let X and Y be sets, let /: X ^ F be a function. Then for every 
subset Y' ^ F, its preimage f~^{Y') ^ X is a subset (see Definition 2.5.1.12). Thus we 
have a function F : P(y) P(X), given by taking preimages. Is it a morphism of partial 
orders? 

Example 3.4.3.5. Let S be a set. The smallest preorder structure that can be put on 
S is to say a ^ 6 iff a = 6. This is indeed refiexive and transitive, and it is called the 
discrete preorder on S. 

The largest preorder structure that can be put on S is to say a ^ 6 for all a^b e S. 
This again is refiexive and transitive, and it is called the indiscrete preorder on S. 

Exercise 3.4.3.6. Let 5 be a set and let (T, ^t) be a preorder. Let be the discrete 
preorder on S. Given a morphism of preorders (S^^d) (^, ^t) we get a function 
S ^T. 

a. ) Which functions S ^ T arise in this way? 

b. ) Given a morphism of preorders (T, ^t) (S^^d)^ we get a function T ^ S. In 

terms of which functions T ^ S arise in this way? 



Exercise 3.4.3.7. Let 5* be a set and let (T, ^t) be a preorder. Let ^/ be the indiscrete 
preorder on S. Given a morphism of preorders (5*, ^/) (T, ^t) we get a function 

a. ) In terms of ^t, which functions S ^ T arise in this way? 

b. ) Given a morphism of preorders (T, ^t) (5*, ^/), we get a function T ^ S. In 

terms of which functions T ^ S arise in this way? 



3.4.4 Other applications 
3.4.4.1 Taxonomy 

Taxonomy is the science of making orders. The most commonly known such taxonomy 
is that of biological organisms. 
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Exercise 3.4.4.2. Should the taxonomy of biological organisms be a preorder, a partial 
order, or a linear order? Can you see any possibility of more than one decent answer? ^ 



Exercise 3.4.4.3. a.) Would you guess that the taxonomic order of biological species has 
all meets? 

b. ) Would you guess it has all joins? 

c. ) What would a meet or a join mean? 



Exercise 3.4.4.4. a.) In your favorite scientific realm, are there any interesting tax- 
onomies (or orders)? 

b.) Choose one; what would meets and joins mean in that setting? 



3.4.4.5 Security 

Security, say of sensitive information, is based on two things: a security clearance and 
"need to know." The former, security clearance might have levels like "confidential", 
"secret", "top secret". But maybe we can throw in "president" and some others too, like 
"plebe". 

Exercise 3.4.4.6. Does it appear that security clearance is a preorder, a partial order, or 
a linear order? 

Need-to-know is another classification of people. For each bit of information, we do 
not necessarily want everyone to know about it, even everyone of the specified clearance. 
It is only disseminated to those that need to know. 

Exercise 3.4.4.7. Let P be the set of all people and let / be the set of all pieces of 
information known by the government. For each subset / ^ /, let ^ P be the set 

of people that need to know every piece of information in /. Let S = {K{I) | / ^ /} 
with the subset relation. 

a. ) Is (P, ^) a preorder? If not, find a nearby preorder. 

b. ) If h ^ h do we have K{Ii) ^ Kih) or K{l2) ^ K{Ii) or neither? 

c. ) Should the preorder (S^^) have all meets? 

d. ) Should it have all joins? 



3.4.4.8 Spaces, e.g. geography 

Consider closed curves that can be drawn in the plane R^, e.g. circles, ellipses, and 
kidney-bean shaped curves. The interiors of these closed curves (not including the bound- 
ary itself) are called basic open sets in M?. The good thing about such an interior U 
is that any point p E U is not on the boundary, so there is a tiny little basic open set 
surrounding p and completely contained in U. In fact, the union of any collection of 

^ Every partial order is a preorder and every linear order is a partial order. Here we want to know 
whether the taxonomy is a preorder-but-not-partial-order, a partial-order-but-not-linear-order, or a lin- 
ear order. 
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basic open sets still has this property. An open set in is any subset U ^M? that can 
be formed as the union of a collection of basic open sets. 

Example 3.4.4.9. Let U = {{x^y) e M? \ x > 0}. To see that U is open, define the 
following sets: for any a, 6 G R, let 5'(a, b) be the square parallel to the axes, with side 
length 1, where the upper left corner is (a, b). Let 5" (a, b) be the interior of 6'(a, b). Then 
each S'{a^b) is open, and U is the union of S'{a^b) over the collection of all a > and 
ah 6, 

U = S'{a,b). 

a, b e R, 
a > 

The idea of open sets extends to spaces beyond R^. For example, on the earth one 
could define a basic open set to be the interior of any region one can "draw a circle 
around" (with a metaphorical pen), and define open sets to be unions of basic open sets. 

Exercise 3.4.4.10. Let S be the set of open subsets on earth, as defined in the above 
paragraph. 

a. ) If ^ is the subset relation, is (S*, ^) a preorder or a partial order? 

b. ) Does it have meets, does it have joins? 



Exercise 3.4.4.11. Let S be the set of open subsets of earth as defined above. To each 
open subset of earth suppose we know the range of recorded temperature throughout s 
(i.e. the low and high throughout the region). Thus to each element s e S we assign an 
interval T{s) := {x \ a ^ x ^b}. If we order the set V of intervals of R by the subset 
relation, it gives a partial order on V. 

a. ) Does our assignment T: S amount to a morphism of orders? 

b. ) Does it preserve meets or joins? (Hint: it doesn't preserve both.) 



Exercise 3.4.4.12. a.) Can you think of a space relevant to your favorite area of science 
for which it makes sense to assign an interval of real numbers to each open set some- 
how, analogously to Exercise 3.4.4.11? For example for a sample of some material 
under stress, perhaps the strain on each open set is somehow an interval? 

b.) Repeat the questions from Exercise 3.4.4.11. 



3.5 Databases: schemas and instances 

The first three sections of this chapter were about classical objects from mathematics. 
The present section is about databases, which are classical objects from computer science. 
These are truly "categories and functors, without admitting it" (see Theorem 4.4.2.3). 

3.5.1 What are databases? 

Data, in particular the set of observations made during experiment, plays ^ a primary 
role in science of any kind. To be useful data must be organized, often in a row-and- 

^The word data is generally considered to be the plural form of the word datum. However, individual 
datum elements are only useful when they are organized into structures (e.g. if one were to shuffle the 
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column display called a table. Columns existing in different tables can refer to the same 
data. 

A database is a collection of tables, each table T of which consists of a set of columns 
and a set of rows. We roughly explain the role of tables, columns, and rows as follows. 
The existence of table T suggests the existence of a fixed methodology for observing 
objects or events of a certain type. Each column c in T prescribes a single kind or 
method of observation, so that the datum inhabiting any cell in column c refers to an 
observation of that kind. Each row r in T has a fixed sourcing event or object, which 
can be observed using the methods prescribed by the columns. The cell (r, c) refers to 
the observation of kind c made on event r. All of the rows in T should refer to uniquely 
identifiable objects or events of a single type, and the name of the table T should refer 
to that type. 

Example 3.5.1.1. When graphene is strained (lengthened by a factor of x ^ 1), it becomes 
stressed (carries a force in the direction of the lengthening). The following is a made-up 
set of data. 



Graphene sample 


ID 


Source 


Stress 


Strain 


A118-1 


C Smkt 








A118-2 


C Smkt 


0.02 


20 


A118-3 


C Smkt 


0.05 


40 


A118-4 


AC 


0.04 


37 


A118-5 


AC 


0.1 


80 


A118-6 


C Plat 


0.1 


82 



Supplier 


ID 


Full name 


Phone 


C Smkt 


Carbon Supermarket 


(541)781-6611 


AC 


Advanced Chemical 


(410) 693-0818 


C Plat 


Carbon Platform 


(510) 719-2857 


McD 


McDonard's Burgers 


(617) 244-4400 


AFP 


Acme Pen and Paper 


(617) 823-5603 



(3.10) 

In the first table, titled "Graphene sample", the rows refer to graphene samples, and 
the table is so named. Each graphene sample can be observed according to the source 
supplier from which it came, the strain that it was subjected to, and the stress that 
it carried. These observations are the columns. In the second table, the rows refer to 
suppliers of various things, and the table is so named. Each supplier can be observed 
according to its full name and its phone number; these are the columns. 

In the left-hand table it appears either that each graphene sample was used only 
once, or that the person recording the data did not keep track of which samples were 
reused. If such details become important later, the lab may want to change the layout 
of the first table by adding on the appropriate column. This can be accomplished using 
morphisms of schemas, which will be discussed in Section 4.4.1. 



3.5.1.2 Primary keys, foreign keys, and data columns 

There is a bit more structure in the above tables (Example 3.10) then may first meet 
the eye. Each table has a primary ID column^ found on the left, as well as some data 
columns and some foreign key columns. The primary key column is tasked with uniquely 
identifying different rows. Each data column houses elementary data of a certain sort. 
Perhaps most interesting from a structural point of view are the foreign key columns, 
because they link one table to another, creating a connection pattern between tables. 

cells in a spreadsheet, most would consider the data to be destroyed). It is the whole organized structure 
that really houses the information; the data must be in formation in order to be useful. Thus I will use 
the word data as a collective noun (akin to the word "sand"); it bridges the divide between the individual 
datum elements (akin to the grains of sand) and the data set (akin to a sand pile). In particular, I will 
often use the word data as a singular noun. 
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Each foreign key column houses data that needs to be further unpacked. It thus refers us 
to another foreign table, in particular the primary ID column of that table. In Example 
3.10 the Source column was a foreign key to the Supplier table. 
Here is another example, lifted from [Sp2]. 

Example 3.5.1.3. Consider the bookkeeping necessary to run a department store. We 
keep track of a set of employees and a set of departments. For each employee e, we keep 
track of 

E.l the first name of e, which is a FirstNameString, 

E.2 the last name of e, which is a LastNameString, 

E.3 the manager of e, which is an Employee, and 

E.4 the department that e works in, which is a Department. 

For each department we keep track of 

D.l the name of which is a DepartmentNameString, and 

D.2 the secretary of which is an Employee. 

Above we can suppose that E.l, E.2, and D.l are data columns (referring to names 
of various sorts), and E.3, E.4, and D.2 are foreign key columns (referring to managers, 
secretaries, etc.). 

Display (3.11) shows how such a database might look at a particular moment in time. 



Employee 


ID 


first 


last 


manager 


worksin 


101 


David 


Hilbert 


103 


qlO 


102 


Bertrand 


Russell 


102 


x02 


103 


Emmy 


Noether 


103 


qlO 



Department 


ID 


name 


secretary 


qlO 


Sales 


101 


x02 


Production 


102 



(3.11) 



3.5.1.4 Business rules 

Looking at the tables from Example 3.5.1.3, one may notice a few patterns. First, every 
employee works in the same department as his or manager. Second, every department's 
secretary works in that department. Perhaps the business counts on these rules for the 
way it structures itself. In that case the database should enforce those rules, i.e. it 
should check that whenever the data is updated, it conforms to the rules: 

Rule 1 For every employee e, the manager of e works in the same department 
that e works in. 

Rule 2 For every department the secretary of d works in department d. 

(3.12) 
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3.5.1.5 Data columns as foreign keys 



To make everything consistent, we could even say that data columns are specific kinds of 
foreign keys. That is, each data column constitutes a foreign key to some non-branching 
leaf table^ which has no additional data. 



Example 3.5.1.6. Consider again Example 3.5.1.3. Note that first names and last names 
had a particular type, which we all but ignored above. We could cease to ignore them 
by adding three tables, as follows. 



FirstNameString 



ID 



Alan 



Alice 



Bertrand 



Carl 



David 



Emmy 



LastNameString 



ID 



Arden 



Hilbert 



Jones 



Noether 



Russell 



DepartmentNameString 



ID 



Marketing 



Production 



Sales 



(3.13) 



In combination. Displays (3.11) and (3.13) form a collection of tables with the prop- 
erty that every column is either a primary key or a foreign key. The notion of data 
column is now subsumed under the notion of foreign key column. Everything is either a 
primary key (one per table, labeled ID) or a foreign key column (everything else). 



3.5.2 Schemas 



The above section may all seem intuitive or reasonable in some ways, but also a bit 
difficult to fully grasp, perhaps. It would be nice to summarize what is happening in a 
picture. Such a picture, which will basically be a graph, should capture the conceptual 
layout to which the data conforms, without yet being concerned with the individual data 
that may populate the tables in this instant. We proceed at first by example, giving the 
precise definition in Definition 3.5.2.6. 



Example 3.5.2.1. In Examples 3.5.1.3 and 3.5.1.6, the conceptual layout for a department 
store was given, and some example tables were shown. We were instructed to keep track 
of employees, departments, and six types of data (E.l, E.2, E.3, E.4, D.l, and D.2), and 
we were instructed to follow two rules (Rule 1, Rule 2). All of this is summarized in the 
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C:= Schema for tables (3.11) and (3.13) conforming to (3.12) 



Employee manager worksin ^ Employee worksin 




Department secretary worksin ^ Department 




manager 




Employee worksin ^ Department 


/ \ secretary 




first \^ last 


name 






FirstNameString LastNameString DepartmentNameString 
• • • 



(3.14) 



The five tables from (3.11) and (3.13) are seen as five vertices; this is also the number of 
primary ID columns. The six foreign key columns from (3.11) and (3.13) are seen as six 
arrows; each points from a table to a foreign table. The two rules from (3.12) are seen as 
statements at the top of Display (3. 14). We will explain path equivalences in Definition 
3.5.2.3. 

Exercise 3.5.2.2. Come up with a schema (consisting of dots and arrows) describing the 
conceptual layout of information presented in Example 3.5.1.1. 

In order to define schemas, we must first define the notion of schematic equivalence 
relation, which is to hold on the set of paths of a graph G (see Section 3.3.2). Such 
an equivalence relation (in addition to being reflexive, symmetric, and transitive) has 
two sorts of additional properties: equivalent paths must have the same source and 
target, and the composition of equivalent paths with other equivalent paths must yield 
equivalent paths. Formally we have Definition 3.5.2.3. 

Definition 3.5.2.3. 

Let G = (V, A, 5rc, tgt) be a graph. A path equivalence declaration (or PED) is a 
phrase of the form p ^ q where p, g e Path^ have the same source and target, src{p) = 
src{q) and tgt{p) = tgt{q). 

A categorical path equivalence relation (or CPER) on G is an equivalence relation ^ 
on Pathc that has the following properties: 

1. li p ^ q then src{p) = src{q). 

2. If p ^ q then tgt{p) = tgt{q). 

3. Suppose p^q \ b ^ c are paths, and m: a ^ 6 is an arrow. If p ^ q then mp ^ mq. 

4. Suppose p^q: a ^ b are paths, and n: 6 ^ c is an arrow. If p q then pn ^ qn. 

Any set of categorical path equivalence declarations generates a categorical path 
equivalence relation (CPER). We tend to elide the difference between CPERs and sets 
of PEDs. 

Exercise 3.5.2.4. Consider the graph shown in (3.14), and the two declarations shown at 
the top. They generate a CPER. 
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a. ) Is it true that under this CPER we have 

Employee manager manager worksin ^ Employee worksin? 

b. ) What about 

Employee worksin secretary ^ Employee? 

c. ) What about 

Department secretary manager worksin name ^ Department name? 



Lemma 3.5.2.5. Suppose that G is a graph and ^ is a CPER on G. Suppose p ^ 
q: a ^ b and r ^ s: b ^ c. Then pr ^ qs. 

Proof. The picture to have in mind is this: 

J- 

• 

Applying condition (3) from Definition 3.5.2.3 to each arrow in path p, it fohows by 
induction that pr ^ ps. Applying condition (4) to each arrow in path 5, it follows 
similarly that ps ^ qs. Because ^ is an equivalence relation, it follows that pr ^ qs. 

□ 

Definition 3.5.2.6. A database schema (or simply schema) C consists of a pair C : = 
(G, ^) where G is a graph and ^ is a categorical path equivalence relation (CPER) on 
G. 

Example 3.5.2.7. The picture drawn in (3.14) has the makings of a schema. Pictured is 
a graph with two PEDs; these generate a CPER, as discussed in Exercise 3.5.2.4. 

A schema can be converted into a system of tables each with a primary key and some 
number of foreign keys referring to other tables, as discussed in Section 3.5.1. Definition 
3.5.2.6 gives a precise conceptual understanding of what a schema is, and the following 
rules describe how to convert such a thing into a table layout. 

Rules of good practice 3.5.2.8. Converting a schema C = (G, ^) into a table layout should 
be done as follows: 

(i) There should be a table for every vertex in G and if the vertex is named, the table 
should have that name; 

(ii) Each table should have a left-most column called ID, set apart from the other 
columns by a double vertical line; and 

(iii) To each arrow a in G having source vertex s := src{a) and target vertex t := tgt{a), 
there should be a foreign key column a in table s, referring to table t; if the arrow 
a is named, column a should have that name. 
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Example 3.5.2.9. [Discrete dynamical system] 
Consider the schema 



Coop : = 



/ 



(3.15) 



in which the CPER is trivial (i.e. generated by the empty set of PEDs.) This schema is 
quite interesting. It encodes a set s and a function f:s^s. Such a thing is called a 
discrete dynamical system. One imagines s as the set of states and, for any state x e 
a notion of "next state" f{x) e s. For example 



s 


ID 


f 


A 


B 


B 


C 


C 


C 


D 


B 


E 


C 


F 


G 


G 


H 


H 


G 



...pictured.. 




(3.16) 



Application 3.5.2.10. Imagine a quantum-time universe in which there are discrete time 
steps. We model it as a discrete dynamical system, i.e. a table of the form (3.16). For 
every possible state of the universe we include a row in the table. The state in the next 
instant is recorded in the second column. 

00 

Example 3.5.2.11 (Finite hierarchy). The schema Coop can also be used to encode hier- 
archies, such as the manager relation from Examples 3.5.1.3 and 3.5.2.1, 



One problem with this, however, is if a schema has even one loop, then it can have in- 
finitely many paths (corresponding, e.g. to an employees manager's manager's manager's 
... manager). 

Sometimes we know that in a given company that process eventually ends, a famous 
example being that at Ben and Jerry's ice cream, there were only seven levels. In that 
case we know that an employee's 8th level manager is equal to his or her 7th level 
manager. This can be encoded by the PED 

E mgr mgr mgr mgr mgr mgr mgr mgr ^ E mgr mgr mgr mgr mgr mgr mgr 

or more concisely, mgr^ = mgr^. 

Exercise 3.5.2.12. Is there any nontrivial PED on Coop that holds for the data in Example 
3.5.2.9? If so, what is it and how many equivalence classes of paths in Coop are there 
after you impose that relation? 

Exercise 3.5.2.13. Let P be a chess-playing program. Given any position (including the 
choice of whose turn it is), P will make a move. 
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a. ) Is this an example of a discrete dynamical system? 

b. ) How do the rules for ending the game in a win or draw play out in this model? (Look 

up online how chess games end if you don't know.) 



3.5.2.14 Ologging schemas 

It should be clear that a database schema is nothing but an olog in disguise. The 
difference is basically the readability requirements for ologs. There is an important new 
addition in this section, namely that we can fill out an olog with data. Conversely, we 
have seen that databases are not any harder to understand than ologs are. 

Example 3.5.2.15. Consider the olog 



a moon 



1 orbits 



a planet 



(3.17) 



We can document some instances of this relationship using the following tables: 



orbits 


a moon 


a planet 


The Moon 


Earth 


Phobos 


Mars 


Deimos 


Mars 


Ganymede 


Jupiter 


Titan 


Saturn 



(3.18) 



Clearly, this table of instances can be updated as more moons are discovered by the 
author (be it by telescope, conversation, or research). 

Exercise 3.5.2.16. In fact. Example 3.5.2.15 did not follow Rules 3.5.2.8. Strictly follow- 
ing those rules, copy over the data from (3.18) into tables that are in accordance with 
schema (3.17). 

Exercise 3.5.2.17. a.) Write down a schema, in terms of the boxes "^a thing I own^ and "^a 
place^ and one additional arrow, that might help one remember where they decided 
to put "random" things. 

b. ) What is a good label for the arrow? 

c. ) Fill in some rows of the corresponding set of tables for your own case. 



Exercise 3.5.2.18. Consider the olog 



a father 



has as first 



a child 



has 



What path equivalence declarations would be appropriate for this olog? You can use 
f:F^C and h: C ^ F if you prefer. 
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3.5.3 Instances 



Given a database schema (G, an instance of it is just a bunch of tables whose data 
conform to the specified layout. These can be seen throughout the previous section, most 
explicitly in the relationship between schema (3.14) and tables (3.11) and (3.13), and 
between schema (3.15) and table (3.16). Below is the mathematical definition. 

Definition 3.5.3.1. Let C = (G, — ) where G = (V^ A^ src^tgt). An instance on C, 
denoted (PK,FK): C Set, consists of some constituents (A. primary ID part, B. 
foreign key part) conforming to a law (1. preservation of CPER), as follows: 

A. a function PK: V Set; i.e. to each vertex v e V we assign a set PK('u);^^ and 

B. for every arrow a E A with v = src{a) and w = tgt{a)^ a function FK(a) : FK{v) 
FK{w). 11 

The following law holds for any vertices v^w and paths p = [ai,a2, . . . , a^] and q = 
[a'l, . . . , a^] from v to w: 

1. If p 0^ q then for all x E PK('u), we have 

FK(a^) o • • . o FK(a2) o FK(ai)(x) = FK{a'J o • • • o FK(4) ^ FK(a;)(x) 

in PK(^). 

Exercise 3.5.3.2. Consider the olog pictured below: 



C :-- 



is sent by 



self-email 



IS 

^ 


an email 




a person 







Given X , a self-email, consider the following. 

We know that x is a self-email, which is an email, 

which is sent by a person that we'll call P{x). 

We also know that x is a self-email, which is an 

email, which is sent to a person that we'll call Q{x). 

Fact: whenever x is a self-email, we will have P{x) = 

Q{x) 



a self -email 


ID 


is 


SEml207 


Eml207 


SEml210 


Eml210 


SEml211 


Eml211 



an email 


ID 


is sent by 


is sent to 


Eml206 


Bob 


Sue 


Eml207 


Carl 


Carl 


Eml208 


Sue 


Martha 


Eml209 


Chris 


Bob 


Eml210 


Chris 


Chris 


Eml211 


Julia 


Julia 


Eml212 


Martha 


Chris 



a person 



ID 



Bob 



Carl 



Chris 



Julia 



Martha 



Sue 



(3.19) 



-'^'-'The elements of PK('u) will be listed as the rows of table v, or more precisely as the leftmost cells 
of these rows. 

-'^-'^The arrow a will correspond to a column, and to each row r G PK('u) the (r, a) cell will contain the 



datum FK(a)(r). 
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a. ) What is the set PKC^an emair)? 

b. ) What is the set PKC^a person^)? 

c. ) What is the function FK(is sent by) : PKC^an email^) PKC^a person^)? 

d. ) Interpret the Englishification mess at the bottom of C as a simple path equivalence 

declaration. Is it satisfied by the instance (3.19); that is, does law 1. from Definition 
3.5.3.1 hold? 



Example 3.5.3.3. [Monoid action table] 

In Example 3.1.2.9, we saw how a monoid Ai could be captured as an olog with only 
one object. As a database schema, this means there is only one table. Every generator 
of Ai would be a column of the table. The notion of database instance for such a schema 
is precisely the notion of action table from Section 3.1.3. 

Exercise 3.5.3.4. Draw (as a graph) the schema for which Table 3.1.3.1 is an instance. 

Exercise 3.5.3.5. Suppose that is a monoid and some instance of it is written out in 
table form. It's possible that 7W is a group. What evidence in an instance table for Ai 
might suggest that is a group? 

3.5.3.6 Paths through a database 

Let C := (G, — ) be a schema and let (PK, FK) : C Set be an instance on C. Then 
for every arrow a: v ^ w in G we get a function FK(a): FK{v) FK{w). Functions 
can be composed, so in fact for every path through G we get a function. Namely, if 
p = [ai , ^2 , . . . , an] is a path from vq to Vn then the instance provides a function 

FK{p) := FK{an) o • • • FK(a2) o FK(ai) : PK(^o) - PK(^n), 

which first made an appearance as part of Law 1 in Definition 3.5.3.1. 

Example 3.5.3.7. Consider the department store schema from Example 3.5.2.1, and in 
(3.14) the path [worksin, secretary, last] which points from Employee to LastNameString. 
The instance will let us interpret this path as a function from the set of employees to 
the set of last names; this could be a useful function to have around. The instance from 
(3.11) would yield the following function 



Employee 


ID 


Seer, name 


101 


Hilbert 


102 


Russell 


103 


Hilbert 



Exercise 3.5.3.8. Consider the pathp := [/, /] on the Coop schema from (3.15). Using the 
instance from (3.16), where PK(s) = {A^B^C^D^E^F^G^H}^ interpret p as a function 
PK(5) PK(s), and write this as a 2-column table, as above in Example 3.5.3.7. 

Exercise 3.5.3.9. a.) Given an instance (PK,FK) on a schema C, and given a trivial 
path p (i.e. p has length 0; it starts at some vertex but doesn't go anywhere), what 
function does p yield? 

b.) What are its domain and codomain? 





Chapter 4 

Basic category theory 



Categories, or an equivalent notion, have already been secretly introduced as ologs. One 
can think of a category as a graph (as in Section 3.3) in which certain paths have been 
declared equivalent. (Ologs demand an extra requirement that everything in sight be 
readable in natural language, and this cannot be part of the mathematical definition 
of category.) The formal definition of category is given in Definition 4.1.1.1, but it 
will not be obviously the same as the "graph+path equivalences" notion; the latter was 
given in Definition 3.5.2.6 as the definition of a schema. Once we talk about how different 
categories can be compared using functors (Definition 4.1.2.1), and how different schemas 
can be compared using schema mappings (Definition 4.4.1.2), we will prove that the two 
notions are equivalent (Theorem 4.4.2.3). 

4.1 Categories and Functors 

In this section we give the standard definition of categories and functors. These, together 
with natural transformations (Section 4.3), form the backbone of category theory. We 
also give some examples. 

4.1.1 Categories 

Definition 4.1.1.1. A category C consists of some constituents (A. objects, B. mor- 
phisms, C. identities, D. compositions) conforming to some laws (1. identity law, 2. 
associativity law), as follows: 

A. a collection Ob(C), elements of which are called objects; 

B. for every pair x^y E Ob(C), a set Homc(x,7/) e Set. It is called the hom-set 
from X to y; its elements are called morphisms from x to y; 

C. for every object x E Ob(C), a specified morphism denoted id^^ g Homc(x,x) 
called the identity morphism on x; and 

D. for every three objects x^y^z E Ob(C) a function 

o: Home (y, 2:) x Homc(x,?/) ^Homc(x,z), 
called the composition formula. 
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Given objects x^y e Ob(C), we can denote a morphism / g Homc(x, y) hy f : x ^ y; we 
say that x is the domain of / and that y is the codomain of /. Given also g: y ^ 
the composition formula is written using infix notation, so g o f : x ^ z means o(^, /) e 
Homc(x, z). 

Here are the laws that harmonize the above constituents: 

1. for every x^y e Ob(C) and every morphism f : x ^ y^ we have 

/ o ida^ = / and id^ o / = /; 

and; 

2. if w^x^y^z G Ob(C) are any objects and f:w^x^ g: x ^ y^ and h: y ^ z 
are any morphisms, then the two ways to compose are the same: 

{hog)o f = ho{go f)e Homc(^, z). 

Remark 4.1.1.2. There is perhaps much that is unfamiliar about Definition 4.1.1.1 but 
there is also one thing that is strange about it. The objects Ob(C) of C are said to be 
a "collection" rather than a set. This is because we sometimes want to talk about the 
category of all sets, in which the objects are the sets, and the collection of sets is not a 
set. Modeling this was a sticking point in the foundations of category theory, but it was 
eventually fixed by Grothendieck's notion of expanding universes. Roughly the idea is to 
choose some huge set (with certain properties making it a universe)^ to work entirely 
inside of it when possible, and to call anything in that world n-small (or just small if 
n is clear from context). When we need to look at n itself, we choose an even bigger 
universe and work entirely within it. 

A category in which the collection Ob(C) is a set (or in the above language, a small 
set) is called a small category. From here on out we will not take care of the difference, 
referring to Ob(C) as a set. We do not think this will do any harm to scientists using 
category theory, at least not in the beginning phases of their learning. 

Example 4.1.1.3 (The category Set of sets). Chapter 2 was all about the category of sets, 
denoted Set. The objects are the sets and the morphisms are the functions; we even 
used the current notation, referring to the set of functions X ^ y as Homset(-^7 ^)- 
The composition formula o is given by function composition, and for every set X, the 
identity function \dx : X ^ X serves as the identity morphism for X G Ob(Set). The 
two laws clearly hold, so Set is indeed a category. 

Example 4.1.1.4. [The category Mon of monoids] 

We defined monoids in Definition 3.1.1.1 and monoid homo morphisms in Definition 
3.1.4.1. Every monoid M := (M, 1m, *m) ^ has an identity homomorphism id^ : M 

given by the identity function idM • M M. To compose two monoid homo- 
morphisms f : Ai ^ Ai' and g: M' we compose their underlying functions 

f : M ^ M' and g\ M' M" ^ and check that the result ^ o / is a monoid homomor- 
phism. Indeed, 

g o /(1m) = ^(ImO = ^M" 
9 o f{rni ★m rn2) = g{f{mi) ★m' /(^2)) = 9^ /(^i) 9 o /(^2). 
It is clear that the two laws hold, so Mon is a category. 



■•^Note, we changed the notation for our monoid because the symbol id was overloaded. 



4.1. CATEGORIES AND FUNCTORS 



103 



Exercise 4.1.1.5 (The category Grp of groups). Suppose we set out to define a category 
Grp, having groups as objects and group homomorphisms as morphisms, see Definition 
3.2.1.16. Show (to the level of detail of Example 4.1.1.4) that the rest of the conditions 
for Grp to be a category are satisfied. 

Exercise 4.1.1.6 (The category PrO of preorders). Suppose we set out to define a cate- 
gory PrO, having preorders as objects and preorder homomorphisms as morphisms (see 
Definition 3.4.3.1). Show (to the level of detail of Example 4.1.1.4 that the rest of the 
conditions for PrO to be a category are satisfied. 

Example 4.1.1.7. [The category of preorders with joins] 

Suppose that we are only interested in preorders (X, ^) for which every pair of 
elements has a join. We saw in Exercise 3.4.2.4 that not all preorders have this property. 
However we can create a category C in which every object does have this property. 
To begin we put Ob(C) = {(X, ^) e Ob(PrO) | (X, ^) has ah joins}. But what about 
morphisms? 

One option would be to put in no morphisms (other than identities), and to just 
consider this collection of objects as having no structure other than a set. 

Another option would be to put in exactly the same morphisms as in PrO: for any 
objects a, 6 G Ob(C) we consider a and b as regular old preorders, and put Rome {a, b) : = 
Hompro(<^7 b)' The resulting category of preorders with joins is called the full subcategory 
of PrO spanned by the preorders with joins. 

A third option, and the one perhaps that would jump out to a category theorist, is 
to take the choice about how we define our objects as a clue to how we should define 
our morphisms. Namely, if we are so interested in joins, perhaps we want joins to be 
preserved under morphisms. That is, if / : (X, ^x) (^, ) is a morphism of preorders 
then for any join w = x v x' in X we might want to enforce that f{w) = f{x) v v{x') in 
Y . Thus a third possibility for the morphisms of C would be 

Home (a, b) := {f e Hompro(<^7 ^) I / preserves joins}. 

One can check easily that the identity morphisms preserve joins and that compositions of 
join-preserving morphisms are join-preserving, so this version of homomorphisms makes 
for a well-defined category. 

Exercise 4.1.1.8 (Category FLin of finite linear orders). We have a category PrO of 
preorders, and some of its objects are finite (nonempty) linear orders. Let FLin be 
the full subcategory of PrO spanned by the linear orders. That is, following Definition 
3.4.3.1, given linear orders X, F , every morphism of preorders X ^ Y counts as a 
morphism in FLin: 

HomFLin(X,r) = Hompro(X,l'). 

For n 6 N, let [n] be the linear order defined in Example 3.4.1.6. What are the 
cardinalities of the following sets: 

a. ) HomFLin([0],[3]); 

b. ) HomFLi„([3],[0]); 

c. ) HomFLin([2],[3]); 

d. ) HomFLi„([l],[n])? 

e. ) (Challenge) HomFLin(["i], M)? 
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It turns out that the category FLin of hnear orders is sufficiently rich that much of al- 
gebraic topology (the study of arbitrary spaces, such as Mobius strips and 7-dimensional 
spheres) can be understood in its terms. See Example 4.6.1.6. 

Example 4.1.1.9 (Category of graphs). We defined graphs in Definition 3.3.1.1 and graph 
homomorphisms in Definition 3.3.3.1. To see that these are sufficient to form a category, 
is routine. Since a morphism from Q = (V^ A, src^tgt) to = {V ^ A' ^ srd ^tgt') involves 
two functions f^'- V ^ V and fi \ A ^ A' ^ the identity and composition formulas will 
simply arise from the identity and composition formulas for sets. Associativity will follow 
similarly. The only thing that needs to be checked, really, is that the composition of two 
such things, each satisfying (3.5), will itself satisfy (3.5). Just for completeness, we check 
that now. 

Suppose that / = (/o,/i)' Q Q' and g = (go^gi)' G' G" are graph homomor- 
phisms, where G" = {V" ^ A" ^ src" ^tgt"). Then in each diagram below 
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Y 

V ■ 



fo 



V" 




(4.1) 



tgt" 



the left-hand square commutes because / is a graph homomorphism and the right-hand 
square commutes because is a graph homomorphism. Thus the whole rectangle com- 
mutes, meaning that ^ o / is a graph homomorphism, as desired. 

We denote the category of graphs and graph homomorphisms by Grph. 

Exercise 4.1.1.10. Let F be a vector field on B? . Recall that for two points x^x' g R^, 
any curve C with endpoints x and x' ^ and any parameterization r: [a, 6] C, the line 
integral F{r)-dr returns a real number. It does not depend on r, except its orientation 
(direction). Therefore, if we think of C has having an orientation, say going from x to 
x' ^ then F is a well-defined real number. If C goes from x to x' ^ let's suggestively 
write C : x ^ x' . Define an equivalence relation ~ on the set of oriented curves in B? by 
saying C ^ C \i 

• C and C start at the same point, 

• C and C end at the same point, and 

• lcP = lc'P- 

Suppose we try to make a category Cf as follows. Put Ob(CF) = R^, and for every 
pair of points x^x' e R^, let IIomc^(x,x') = {C: x ^ x^}/ ~, where C: x ^ xMs an 
oriented curve and ^ means "same line integral", as explained above. 

Is there an identity morphism and a composition formula that will make Cf into a 
category? 



4.1.1.11 Isomorphisms 

In any category we have a notion of isomorphism between objects. 

Definition 4.1.1.12. Let C be a category and let X^Y e Ob(C) be objects. An isomor- 
phism f from X to Y is di morphism / : X ^ F in C, such that there exists a morphism 
g: Y ^ X in C such that 



go f = idx and f og = idy . 
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In this case we say that the morphism / is invertible and that g is the inverse of /. We 
may also say that the objects X and Y are isomorphic. 

Example 4.1.1.13. If C = Set is the category of sets, then the above definition coincides 
precisely with the one given in Definition 2.1.2.7. 

Exercise 4.1.1.14. Suppose that G = {V, A, src.tgt) and G' = {V , A' , src' ,tgt') are 
graphs and that / = (fo^fi)'- G G' is a graph homomorphism (as in Definition 
3.3.3.1). 

a. ) If / is an isomorphism in Grph, does this imply that fo'- V ^ V and fi'. A ^ A' 

are isomorphisms in Set? 

b. ) If so, why; and if not, show a counterexample (where / is an isomorphism but either 

/o or /i is not). 



Exercise 4.1.1.15. Suppose that G = (V^ A^ src^tgt) and G' = (V\ A' ^ src' ^tgf) are 
graphs and that / = (fo^fi)'- G G' is a graph homomorphism (as in Definition 
3.3.3.1). 

a. ) li f^: V ^ V and fi: A —> A' are isomorphisms in Set, does this imply that / is 

an isomorphism in Grph? 

b. ) If so, why; and if not, show a counterexample (where /o and /i are isomorphisms 

but / is not). 



Lemma 4.1.1.16. Let C he a category and let ~ he the relation on Ob(C) given hy saying 
X ^ Y iff X and Y are isomorphic. Then ~ is an equivalence relation. 

Proof. The proof of Lemma 2.1.2.11 can be mimicked in this more general setting. 

□ 



4.1.1.17 A more formal viewpoint on categories 

Here is an alternate definition of category, using the work we did in Chapter 2. 

Exercise 4.1.1.18. Suppose we begin our definition of category as follows. 
A category, C consists of a sequence (Ob(C), Hom^, (iom, ccxi, id, o), where 

1. Ob(C) is a set, 

2. Home is a set, and dam, cod: Hom^ Ob(C) are functions, 

3. id: Ob(C) Home is a function, and 
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4. o is a function as depicted in the commutative diagram below 




Ob(C) 

a. ) Express the fact that for any x e Ob(C) the morphism id^^ points from x to x in 

terms of the functions id, dom, cod. 

b. ) Express the condition that composing a morphism / with an appropriate identity 

morphism yields /. 

c. ) Express the associativity law in these terms (Hint: Proposition 2.5.1.17 may be 

useful) . 



Example 4.1.1.19 (Partial olog for a category). Below is an olog that captures some of 
the essential structures of a category. 



a morphism 



in 




an object of C 

(4.3) 

Missing from (4.3) is the notion of identity morphism (as an arrow from "^an object 
of to "^a morphism in C^) and the associated path equivalences, as well as the identity 
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and associativity laws. All of these can be added to the olog, at the expense of some 
clutter. 

Remark 4.1.1.20. Perhaps it is already clear that category theory is very wrapped up 
in itself. It feels like everything relates to everything, and this feeling may intensify as 
you go on. However, the relationships between different notions are rigorously defined, 
and almost everything presented here can be formalized in a proof system like Coq (the 
most obvious exceptions being things like the readability requirement of ologs and the 
modeling of scientific applications). 

If you feel cognitive vertigo, look to formal definitions as the ground of your under- 
standing. 

4.1.2 Functors 

A category C = (Ob(C), Hom^, dom, cod, id, o), involves a set of objects a set of mor- 
phisms, a notion of domains and codomains, a notion of identity morphisms, and a 
composition formula. For two categories to be comparable, these various components 
should be appropriately comparable. 

Definition 4.1.2.1. Let C and C be categories. A functor F from C to C ^ denoted 
F: C ^ C consists of some constituents (A. on-objects part, B. on-morphisms part) 
conforming to some laws (1. preservation of identities, 2. preservation of composition), 
as follows: 

A. a function Ob(F) : Ob(C) Ob(C'), which we sometimes denote simply by 
F: Ob(C) ^ Ob(C0; and 

B. for every pair of objects c, d g Ob(C), a function 

Homi?(c, (i): Hom^ (c,(i) IIomc/(F(c), F((i)), 
which we sometimes denote simply by F: Homc(c, d) Hom^/ (F(c), F((i)). 
The following laws harmonize the above constituents: 

1. Identities are preserved by F. That is, for any object c e Ob(C), we have 
F(idc) = idi7^(c); and 

2. Composition is preserved by F. That is, for any objects 6, c, d g Ob(C) and 
morphisms g: b ^ c and h: c ^ we have F{h o g) = F{h) o F{g). 

Example 4.1.2.2 (Monoids have underlying sets). Recall from Definition 3.1.1.1 that if 
M = (M, id, *) is a monoid, then M is a set. And recall from Definition 3.1.4.1 that if 
f : A4 ^ M.' is a monoid homomorphism then f : M ^ M' is a function. Thus we have 
a functor 

U: Mon Set 

that takes every monoid to its underlying set and every monoid homomorphism to its 
underlying function. 

Given two monoids M = (M, id, ★) and = (M',id^*05 there may be many 
functions from M to that do not arise from monoid homo morphisms. It is often 
useful to speak of such functions. For example, one could assign to every command in 
one video game V a command in another video game V, but this may not work in the 
"monoidy way" when performing a sequence of commands. By being able to speak of 
M as a set, or as as a monoid, and understanding the relationship U between them, 
we can be clear about where we stand at all times in our discussion. 
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Example 4.1.2.3. [Groups have underlying monoids] 

Recall that a group is just a monoid (M, id, ★) with the extra property that every 
element m e M has an inverse m'-km = id = rn-km' . Thus to every group we can assign its 
underlying monoid. Similarly, a group homomorphism is just a monoid homomorphism 
of its underlying monoids. This means that there is a functor 

U\ Grp Mon 

that sends every group or group homomorphism to its underlying monoid or monoid 
homomorphism. That identity and composition are preserved is obvious. 

Application 4.1.2.4. Suppose you're a scientist working with symmetries. But then sup- 
pose that the symmetry breaks somewhere, or you add some extra observable which is 
not reversible under the symmetry. You want to seamlessly relax the requirement that 
every action be reversible without changing anything else. You want to know where you 
can go, or what's allowed. The answer is to simply pass from the category of groups (or 
group actions) to the category of monoids (or monoid actions). 

00 

Exercise 4.1.2.5 (Forgetful functors between types of orders). A partial order is just a 
preorder with a special property. A linear order is just a partial order with a special 
property. 

a. ) Is there an "obvious" functor FLin PrO? 

b. ) Is there an "obvious" functor PrO FLin? 



Proposition 4.1.2.6. [Preorders to graphs] 

Let PrO he the category of preorders and Grph he the category of graphs. There is 
a functor U : PrO Grph such that for any preorder X = (X, ^)^ the graph U{X) has 
vertices X . 

Proof. Given a preorder X = (X, ^x), we can make a graph F{X) with vertices X 
and an arrow x ^ x' whenever x ^^ as in Remark 3.4.1.9. More precisely, the 
preorder is a relation, i.e. a subset Rx ^ X x X., which we think of as a function 
i: Rx X X X. Composing with projections tti, 7r2 : X x X ^ X gives us 

srcx := TTi o i: Rx X and tgtx := 7T2 o i: Rx X. 

Then we put F{X) := {X^ Rx^ srcx^tgtx)- This gives us a function F : Ob(PrO) 
Ob(Grph). 

Suppose now that /: A' ^ 3^ is a preorder morphism (where y = (F, ^y)). This is a 
function f.X^Y such that for any (x, x^) e X x X, if x ^' then f{x) ^ /(x'). But 
that's the same as saying that there exists a dotted arrow making the following diagram 
of sets commute 

Rx ^ X X X 

/X/ 

Ry ^ Y xY 



(Note that there cannot be two different dotted arrows making that diagram commute 
because Ry ^ F x F is a monomorphism.) Our commutative square is precisely what's 
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needed for a graph homomorphism, as shown in Exercise 3.3.3.6. Thus, we have defined 
F on objects and on morphisms. It is clear that F preserves identity and composition. 

□ 

Exercise 4.1.2.7. In Proposition 4.1.2.6 we gave a functor U : PrO Grph. 

a. ) Is every graph G e Ob (Grph) in the image of U (or more precisely, is the function 

Oh{U): Ob(PrO) ^ Ob(Grph) 

surjective)? 

b. ) If so, why; if not, name a graph not in the image. 

c. ) Suppose that G^H e Ob(Grph) are two graphs that are in the image of U. Is every 

graph homomorphism /: G ^ H in the image of Homf/? In other words, does every 
graph homomorphism between G and H come from a preorder homomorphism? 



Remark 4.1.2.8. There is a functor W : PrO Set sending (X, ^) to X. There is 
a functor T: Grph Set sending (V^ A^ src^tgt) to V. When we understand the 
category of categories (Section 4.1.2.24), it will be clear that Proposition 4.1.2.6 can be 
summarized as a commutative triangle in Cat, 



PrO ^ Grph 




Set 



Exercise 4.1.2.9. [Graphs to preorders] Recall from (2.3) that every function f : A ^ 
B has an image, mif[A) ^ B. Use this idea and Example 3.4.1.13 to construct a 
functor Im: Grph PrO such that for any graph G = (V^ A^ src^tgt)^ the preorder 
has elements given by the vertices of G (i.e. we have Im{G) = (F, ^g), for some ordering 
^g). 

Exercise 4.1.2.10. What is the preorder Im{G) when G e Ob (Grph) is the following 
graph? 



9 

V f w ^ X 

• ^ • • 






Exercise 4.1.2.11. Consider the functor Im\ Grph PrO constructed in Exercise 
4.1.2.9. 

a.) Is every preorder X E Ob (PrO) in the image of Im (or more precisely in the image 
of Ob(/m) : Ob(Grph) ^ Ob(PrO))? 
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b. ) If so, why; if not, name a preorder not in the image. 

c. ) Suppose that X^y e Ob(PrO) are two preorders that are in the image of Im. Is 

every preorder morphism /: X ^ y in the image of Hom/^? In other words, does 
every preorder homomorphism between X and y come from a graph homomorphism? 



Exercise 4.1.2.12. We have functors U : PrO Grph and /m: Grph PrO. 

a. ) What can you say about Im o U : PrO PrO? 

b. ) What can you say about U o Im: Grph Grph? 



Remark 4.1.2.13. In the course announcement for 18-S996, I wrote the following: 

It is often useful to focus ones study by viewing an individual thing, or a 
group of things, as though it exists in isolation. However, the ability to 
rigorously change our point of view, seeing our object of study in a different 
context, often yields unexpected insights. Moreover this ability to change 
perspective is indispensable for effectively communicating with and learning 
from others. It is the relationships between things, rather than the things 
in and by themselves, that are responsible for generating the rich variety 
of phenomena we observe in the physical, informational, and mathematical 
worlds. 

This holds at many different levels. For example, one can study a group (in the sense of 
Definition 3.2.1.1) in isolation, trying to understand its subgroups or its automorphisms, 
and this is mathematically interesting. But one can also view it as a quotient of something 
else, or as a subgroup of something else. One can view the group as a monoid and look 
at monoid homomorphisms to or from it. One can look at the group in the context of 
symmetries by seeing how the it acts on sets. These changes of viewpoint are all clearly 
and formally expressible within category theory. We know how the different changes of 
viewpoint compose and how they fit together in a larger context. 

Exercise 4.1.2.14. a.) Is the above quote also true in your scientific discipline of exper- 
tise? How so? 

b. ) Can you imagine a way that category theory can help catalogue the kinds of rela- 

tionships or changes of viewpoint that exist in your discipline? 

c. ) What kinds of structures that you use often really deserve to be better formalized? 

Keep this kind of question in mind for your final project. 

Example 4.1.2.15 (Free monoids). Let G be a set. We saw in 3.1.1.12 that List(G) is a 
monoid, called the free monoid on G. Given a function f : G ^ G' ^ there is an induced 
function List(/): List(G) List(G'), and this preserves the identity element [ ] and 
concatenation of lists, so List(/) is a monoid homomorphism. It is easy to check that 
List : Set Mon is a functor. 

Application 4.1.2.16. In Application 2.1.2.9 we discussed an isomorphism Nucdna = 
Nucrna given by RNA transcription. Applying the functor List we get a function 

List(NucDNA) ^ List(NucRNA), 
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which will send sequences of DNA nucleotides to sequences of RNA nucleotides and vice 
versa. This is performed by polymerases. 

00 

Exercise 4.1.2.17. Let G = {1, 2, 3, 4, 5}, = {a,6,c}, and let /: G ^ be given by 
the sequence (a, c, 6, a, c).^ Then if L = [1, 1, 3, 5, 4, 5, 3, 2, 4, 1], what is List(/)(L)? 

Exercise 4.1.2.18. We can rephrase our notion of functor in terms compatible with Ex- 
ercise 4.1.1.18. We would begin by saying that a functor F: C ^ C consists of two 
functions, 

Ob(F) : Ob(C) ^ Oh{C') and Hom^ : Home ^ Homes 

which we call the on-objects part and the on-morphisms part^ respectively. They must 
follow some rules, expressed by the commutativity of the following squares in Set: 



Home Ob(C) 



Home Ob(C) 



(4.4) 



HorQi; 



Ob(F) 



Horn p 



Ob(F) 



Home 



dom 



Ob(co 



Hom^/ 



cod 



ob(co 



Ob(C) - 

Ob(F) 

Ob(C0 



id 



■ Home 

HorQi; 

Home' 



Home xob(C) Home - 
Home/ xob(co Hom^/ 



- Home 

Horn f 

Y 

■ Home' 



(4.5) 



Where does the (unlabeled) left-hand function in the bottom right diagram come from? 
Hint: use Exercise 2.5.1.19. 

Consider Diagram (4.2) and imagine it as though contained in a pane of glass. Then 
imagine a parallel pane of glass involving C in place of C everywhere. 

a. ) Draw arrows from the C pane to the C pane, each labeled Oh{F) or Hom^? as seems 

appropriate. 

b. ) If F is a functor (i.e. satisfies (4.4) and (4.5), do all the squares in your drawing 

commute? 

c. ) Does the definition of functor involve anything not captured in this setup? 



Example 4.1.2.19. [paths-graph] 

Let G = (F, A, src, tgt) be a graph. Then for any pair of vertices v^w e G^ there is a 
set PathG('^, w) of paths from v to w; see Definition 3.3.2.1. In fact there is a set Path^ 
and functions Wc^tgt: Path^ V. That information is enough to define a new graph, 

Paths(G) := (F, Pat he, src, t^). 

Moreover, given a graph homomorphism /: G ^ G' ^ every path in G is sent under / 
to a path in G' . So Paths: Grph Grph is a functor. 



See Exercise 2.1.2.14 in case there is any confusion with this. 
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Exercise 4.1.2.20. a.) Consider the graph G from Example 3.3.3.3. Draw the paths- 
graph Paths(G) for G. 

b. ) Repeating the above exercise for G' from the same example would be hard, because 

the path graph Paths(G') has infinitely many arrows. However, the graph homomor- 
phism f : G ^ G' does induce a morphism of paths-graphs Paths(/) : Paths(G) 
Paths(G'), and it is possible to say how that acts on the vertices and arrows of 
Paths(G). Please do so. 

c. ) Given a graph homomorphism /: G ^ G' and two paths p: v ^ w and q: w ^ x in 

G, is it true that Paths (/) preserves the concatenation? What does that even mean? 



Exercise 4.1.2.21. Suppose that C and V are categories, c, g Ob(C) are objects, and 
F : C ^ V is 8i functor. Suppose that c and c' are isomorphic in C. Show that this 
implies that F{c) and F{c') are isomorphic in V. 

Example 4.1.2.22. For any graph G, we can assign its set of loops Eq{G) as in Exercise 
3.3.1.10. This assignment is functorial in that given a graph homomorphism G ^ G' 
there is an induced function Eq{G) Eq{G'). Similarly, we can functorially assign the 
set of connected components of the graph, CoeqiG). In other words Eq: Grph Set 
and Coeq: Grph Set are functors. The assignment of vertex set and arrow set are 
two more functors Grph Set. 

Suppose you want to decide whether two graphs G and G' are isomorphic. Supposing 
that the graphs have thousands of vertices and thousands of arrows, this could take a 
long time. However, the functors above, in combination with Exercise 4.1.2.21 give us 
some things to try. 

The first thing to do is to count the number of loops of each, because these numbers 
are generally small. If the number of loops in G is different than the number of loops 
in G' then because functors preserve isomorphisms, G and G' cannot be isomorphic. 
Similarly one can count the number of connected components, again generally a small 
number; if the number of components in G is different than the number of components 
in G' then G ^ G'. Similarly, one can simply count the number of vertices or the number 
of arrows in G and G^ These are all isomorphism invariants. 

All this is a bit like trying to decide if a number is prime by checking if it's even, if 
its digits add up to a multiple of 3, or it ends in a 5; these tests do not determine the 
answer, but they offer some level of discernment. 

Remark 4.1.2.23. In the introduction I said that functors allow ideas in one domain to 
be rigorously imported to another. Example 4.1.2.22 is a first taste. Because functors 
preserve isomorphisms, we can tell graphs apart by looking at them in a simpler category. 
Set. There is relatively simple theorem in Set that says that for different natural 
numbers m, n the sets m and n are never isomorphic. This theorem is transported via 
our four functors to four different theorems about telling graphs apart. 

4.1.2.24 The category of categories 

Recall from Remark 4.1.1.2 that a small category C is one in which Ob(C) is a set. We 
have not really been paying attention to this issue, and everything we have said so far 
works whether C is small or not. In the following definition we really ought to be a little 
more careful, so we are. 
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Proposition 4.1.2.25. There exists a category, called the category of small categories 
and denoted Cat^ in which the objects are the small categories and the morphisms are 
the functors, 

Homcat(C,P) = {F: C \ F is 8i functor}. 

That is, there are identity functors, functors can be composed, and the identity and 
associativity laws hold. 

Proof. We follow Definition 4.1.1.1. We have specified Ob (Cat) and Homcat already. 
Given a small category C, there is an identity functor idc : C ^ C that is identity on the 
set of objects and the set of morphisms. And given a functor F: C ^ V and a functor 
G: P ^ f , it is easy to check that G o F: C ^ defined by composition of functions 
Ob(G) o Ob(F): Ob(C) ^ Oh{£) and Hom^ o Hom^: Home ^ Hom£: (see Exercise 
4.1.2.18), is a functor. For the same reasons, it is easy to show that functors obey the 
identity law and the composition formula. Therefore this specification of Cat satisfies 
the definition of being a category. 

□ 

Example 4.1.2.26. [Categories have underlying graphs] 

Let C = ( Ob (C), Home, (iom, cod, id, o) be a category (see Exercise 4.1.1.18). Then 
( Ob (C), Home, dom, cod) is a graph, which we will call the graph underlying C and denote 
by U{C) G Ob(Grph). A functor F : C ^ V induces a graph morphism U{F): U{C) 
U{V)^ as seen in (4.4). So we have a functor, 

[/: Cat ^ Grph. 

Example A. 1.2. 21 . [Free category on a graph] 

In Example 4.1.2.19, we discussed a functor Paths: Grph Grph that considered 
all the paths in a graph G as the arrows of a new graph Paths(G). In fact, Paths(G) 
could be construed as a category, which we will denote F{G) e Ob (Cat) and call the 
free category generated by G. Indeed for any vertex v the trivial path at v serves as the 
identity element, and paths can be concatenated to give a composition. To see that F 
is a functor, we need to see that a graph homomorphism /: G ^ G' induces a functor 
F{f) : F{G) F{G'). But this was shown in Exercise 4.1.2.20. Thus we have a functor 

F: Grph ^ Cat 

called the free category functor. 
Exercise 4.1.2.28. Let G be the graph depicted 

vo e 
• ^•^ 

and let [1] g Ob (Cat) denote the free category on G (see Example 4.1.2.27). We call [1] 

the free arrow category. 

a. ) What are its objects? 

b. ) For every pair of objects in [1], write down the hom-set. 



Exercise 4.1.2.29. Let G be the graph whose vertices are all cities in the US and whose 
arrows are airplane fiights connecting cities. What idea is captured by the free category 
on G? 
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Exercise 4.1.2.30. Let F : Grph Cat denote the free category functor from Example 
4.1.2.27, and let U : Cat Grph denote the underlying graph functor from Example 
4.1.2.27. We have seen the composition U o F: Grph Grph before; what was it 
called? 

Exercise 4.1.2.31. Recall the graph G from Example 3.3.1.2. Let C = F{G) be the free 
category on G. 

a. ) What is Homc('u,x)? 

b. ) What is Homc(x, 'y)? 



Example 4.1.2.32. [Discrete graphs, discrete categories] 

There is a functor Disc. Set Grph that sends a set S to the graph 

Disc{S):={S,0,\,\), 

where ! : ^ 6* is the unique function. We call Disc{S) the discrete graph on the set S. 
It is clear that a function S ^ induces a morphism of discrete graphs. Now applying 
the free category functor F: Grph Cat, we get the so-called discrete category on the 
set which we also might call Disc: Set Cat. 

Exercise 4.1.2.33. Recall the definition of the set n for any natural number n g M, and 
let Dn := Disc{n) e Ob(Cat). 

a. ) List all the morphisms in D4. 

b. ) List all the functors Ds D2. 



Exercise 4.1.2.34 (Terminal category). Let C be a category. How many functors are there 
C ^ Di^ where Di := Disc{l) is the discrete category on one element? 

We sometimes refer to Disc{l) as the terminal category (for reasons that will be made 
clear in Section 4.5.3), and for simplicity denote it by 1. 

Exercise 4.1.2.35. If someone said "Ob is a functor from Cat to Set," what might they 
mean? 



4.2 More basic examples from mathematics 
4.2.1 Monoids, groups, preorders, and graphs 

We saw in Section 4.1.1 that there is a category Mon of monoids, a category Grp of 
groups, a category PrO of preorders, and a category Grph of graphs. In this section we 
show that each monoid each group 5, and each preorder V can be considered as its 
own category. If each object in Mon is a category, we might hope that each morphism 
in Mon is just a functor, and this is true. The same holds for Grp and PrO. We wih 
deal with graphs in Section 4.2.1.19. 

4.2.1.1 Monoids as categories 

In Example 3.1.2.9 we said that to olog a monoid, we should use only one box. And 
again in Example 3.5.3.3 we said that a monoid action could be captured by only one 
table. These ideas emanated from the understanding that a monoid is perfectly modeled 
as a category with one object. 
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Each monoid as a category with one object Let (M, id, ★) be a monoid. We 
consider it as a category M. with one object, Ob(A^) = {a}, and 

HomA4(A, a) := M. 

The identity morphism id^ serves as the monoid identity id, and the composition formula 

o : Hom;\^ (a, a) X Hom;\^ i^i^) ~^ Hom;\^ (a, a) 

is given by ★ : M x M ^ M. The associativity and identity laws for the monoid match 
precisely with the associativity and identity laws for categories. 

If monoids are categories with one object, is there any categorical way of phrasing 
the notion of monoid homomorphism? Suppose that M = (M, idM,*M) and = 
{M\ id\ We know that a monoid homomorphism is a function f : M ^ M' such that 
/(id) = id' and such that for every pair mo, mi e M we have /(mo*mi) = /(mo)*' /(mi). 
What is a functor M ^ 

Each monoid homomorphism as a functor between one-object categories Say 

that Ob(A^) = {a} and Ob(A^') = {a'}; and we know that Hom;^(A,A) = M and 
HomA4/(A^A') = M'. A functor F\ M ^ M' consists first of a function Oh{M) 
Ob(7W), but these sets have only one element each, so there is nothing to say on that 
front. It also consists of a function Hom^ hom;\^/ but that is just a function M M' . 
The identity and composition formulas for functors match precisely with the identity and 
composition formula for monoid homomorphisms, as discussed above. Thus a monoid 
homomorphism is nothing more than a functor between one-object categories. 

SloganA.2.1.2. 

" A monoid is a category Q with one object. A monoid homomorphism is 
just a functor between one-object categories. " 

We formalize this as the following theorem. 
Theorem 4.2.1.3. There is a functor i: Mon Cat with the following properties: 

• for every monoid Ai e Ob(Mon)^ the category i{M.) e Ob(Cat) itself has exactly 
one object, 

\Oh{i{M))\ = 1 

• for every pair of monoids A^, Ai' e Ob (Mon) the function 

HomMon(A4, A^O ^ Y{omc^t(i{M),i{M')), 

induced by the functor i, is a bijection. 

Proof. This is basically the content of the preceding paragraphs. The functor i sends a 
monoid to the corresponding category with one object and i sends a monoid homomor- 
phism to the corresponding functor; it is not hard to check that i preserves identities 
and compositions. 

□ 

Theorem 4.2.1.3 situates the theory of monoids very nicely within the world of cate- 
gories. But we have other ways of thinking about monoids, namely their actions on sets. 
As such it would greatly strengthen the story if we could subsume monoid actions within 
category theory also, and we can. 
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Each monoid action as a set- valued functor Recall from Definition 3.1.2.1 that if 
(M, id, *) is a monoid, an action consists of a set S and a function G: M x S ^ S such 
that id Q 5 = 5 and mo G (mi G s) = (mo ★ mi) Q 5 for all s e S. How might we 
relate the notion of monoid actions to the notion of functors? One idea is to try asking 
what a functor F: Ai ^ Set is; this idea will work. 

Since M. has only one object, we obtain one set, S := F{a) e Ob(Set). We also 
obtain a function Hom^?: Hom;v4(A,A) Homset (^(^), ^(^)), or more concisely, a 
function 

Hp: M ^Roms^t{S,S). 

By currying (see Proposition 2.7.2.3), this is the same as a function O : M x S ^ S. The 
rule that id Q s = 5 becomes the rule that functors preserve identities, Homi7(idA) = id^. 
The other rule is equivalent to the composition formula for functors. 

4.2.1.4 Groups as categories 

A group is just a monoid (M, id, *) in which every element m e M is invertible, meaning 
there exists some m' e M with m ^ m! = id = m' * m. If a monoid is the same thing 
as a category Ai with one object, then a group must be a category with one object 
and with an additional property having to do with invertibility. The elements of M are 
the morphisms of the category A^, so we need a notion of invertibility for morphisms. 
Luckily we have such a notion already, namely isomorphism. We have the following: 

Slogan 1. b. 

" A group is a category Q with one object, such that every morphism in Q 
is an isomorphism. A group homomorphism is just a functor between such 
categories. " 

Theorem 4.2.1.6. There is a functor i: Grp Cat with the following properties: 

• for every group Q e Ob(Grp)^ the category i{Q) e Ob(Cat) itself has exactly one 
object, and every morphism m in i{Q) is an isomorphism; and 

• for every pair of groups G^Q' e Ob (Grp) the function 

HomGrp(S,SO ^ Homcat(^(5),^(S0)^ 

induced by the functor i, is a bijection. 

Just as with monoids, an action of some group (G, id, ★) on a set 5 g Ob (Set) is the 
same thing as a functor Q Set sending the unique object of Q to the set S. 

4.2.1.7 Monoids and groups stationed at each object in a category 

If a monoid is just a category with one object, we can locate monoids in any category C 
by narrowing our gaze to one object in C Similarly for groups. 

Example 4.2.1.8 (Endomorphism monoid). Let C be a category and x e Ob(C) an object. 
Let M = Homc(x,x). Note that for any two elements f^g e M we have f o g: x ^ x 
in M. Let Ai = (M, id^^,©). It is easy to check that is a monoid; it is called the 
endomorphism monoid of x in C. 
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Example 4.2.1.9 (Automorphism group). Let C be a category and x g Ob(C) an object. 
Let G = {f : X ^ X \ f is dLii isomorphism}. Let Q = (G, id^^, o). It is easy to check that 
^ is a group; it is cahed the automorphism group of x in C. 

Exercise 4.2.1.10. Let S = {1,2,3,4} g Ob(Set). 

a. ) What is the automorphism group of S in Set, and how many elements does this 

group have? 

b. ) What is the endomorphism monoid of S in Set, and how many elements does this 

monoid have? 

c. ) Recall from Example 4.1.2.3 that every group has an underlying monoid U{G); is 

the endomorphism monoid of S the underlying monoid of the automorphism group 
of S7 



Exercise 4.2.1.11. Consider the graph G depicted below. 
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What is its group of automorphisms? Hint: every automorphism of G will induce an 
automorphism of the set {1, 2, 3, 4}; which ones will preserve the arrows? 

4.2.1.12 Preorders as categories 

A preorder (X, ^) consists of a set X and a binary relation ^ that is reflexive and 
transitive. We can make from (X, ^) g Ob(PrO) a category A' g Ob(Cat) as follows. 
Define Ob (A') = X and for every two objects x^y e X define 



{"x ^ ?/"} if X ^ y 
if X ^y 



To clarify: if x ^ y^ we assign }iomx{x^y) to be the set containing only one element, 
namely the string "x ^ y^\^ If {x^y) is not in relation ^, then we assign Hom;f(x,?/) to 
be the empty set. The composition formula 

o: Hom;f(x,?/) X Hom;t'(^,^) ^ Hom;f(x,z) (4.6) 

is completely determined because either one of two possibilities occurs. One possibility 
is that the left-hand side is empty (if either x ^ y or y ^ z; in this case there is a unique 
function o as in (4.6). The other possibility is that the left-hand side is not empty in 
case X ^ y and y ^, which implies x ^ so the right-hand side has exactly one element 
^ in which case again there is a unique function o as in (4.6). 



^The name of this morphism is completely unimportant. What matters is that }ioinLi;^(x,y) has 
exactly one element x ^ y. 
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On the other hand, if C is a category having the property that for every pair of objects 
x^y E Ob(C), the set Homc(x,?/) is either empty or has one element, then we can form 
a preorder out of C. Namely, take X = Ob(C) and say x ^ y if there exists a morphism 
X ^ y in C. 

Exercise 4.2.1.13. We have seen that a preorder can be considered as a category V. Recall 
from Definition 3.4.1.1 that a partial order is a preorder with an additional property. 
Phrase the defining property for preorders in terms of isomorphisms in the category V. 



Exercise 4.2.1.14. Suppose that C is a preorder (considered as a category). Let x^y e 
Ob(C) be objects such that x ^ y and y ^ x. Prove that there is an isomorphism x ^ y 
in C. 

Example 4.2.1.15. The olog from Example 3.4.1.3 depicted a partial order, say V. In it 
we have 

Hom-pC^a diamond^, "^a red card^) = {is} 

and we have 

Hom-pC^a black queen\ "^a card^) = {is o is}; 

Both of these sets contain exactly one element, the name is not important. The set 
Hom-pC^a 4\ "^a 4 of diamonds^) = 0. 

Exercise 4.2.1.16. Every linear order is a partial order with a special property. Can you 

phrase this property in terms of hom-sets? 

Proposition 4.2.1.17. There is a functor i: PrO Cat with the following properties 
for every preorder (X, ^).' 

1. the category X := ^) has objects Ob(A') = X ; and 

2. for each pair of elements x^x' e X the set Hom;t'(^, x') has at most one element. 
Moreover, any category with property 2 is in the image of i. 

Proof. We have already shown how i works on objects. To see that it is a functor we 
must show that for any morphism /: (X, ^x) (Y^ ^y) of preorders, there is a functor 
X ^ where y := (F, ^y). This follows directly from definitions. The second 
claim, that any category with property 2 is in the image of i follows from the discussion 
above. 

□ 

Exercise 4.2.1.18. Recall the functor U : PrO Grph from Proposition 4.1.2.6, the 
functor F: Grph Cat from Example 4.1.2.27, and the functor i: PrO Cat from 
Proposition 4.2.1.17. 

a.) Does the following diagram of categories commute? 
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b.) We also had a functor Grph PrO. Does the fohowing diagram of categories 
commute? 

Grph ^ PrO 




Cat 



4.2.1.19 Graphs as functors 

Let C denote the category depicted below 



C :-- 



tgt 



(4.7) 



Then a functor G: C ^ Set is the same thing as two sets G(A), G{V) and two functions 
G{src): G{A) G{V) and G{tgt): G{A) G{V). This is precisely what is needed for 
a graph; see Definition 3.3.1.1. We call C the graph indexing category. 

Exercise 4.2.1.20. Consider the terminal category, 1, also known as the discrete category 
on one element (see Exercise 4.1.2.34). Let C be as in (4.7) and consider the functor 
zq: 1 ^ C sending the object of 1 to the object V e Ob(C). If G: C ^ Set is a graph, 
what is the composite G o zq? It consists of only one set; what set is it? For example, 
what set is it when G is the graph from Example 3.3.3.3. 

If a graph is a functor C Set, what is a graph homomorphism? We will see later 
in Example 4.3.1.12 that graph homomorphisms are homomorphisms between functors, 
which are called natural transformations. (Natural transformations are the highest- 
" level" structure that occurs in ordinary category theory.) 

Example 4.2.1.21. Let V be the category whose underlying graph is depicted below 



■Q' 



tgt 



(4.8) 



with the following composition formula: 



pop = idA'', srcop = tgt] and tgt o p = src. 

This category V is the symmetric graph indexing category. Just like any graph can 
be understood as a functor C Set, where C is the graph indexing category displayed 
in (4.7), any symmetric graph can be understood as a functor V Set, where V is the 
category drawn above. 

It is customary to draw the connections in a symmetric graphs as line segments 
rather than arrows between vertices. However, a better heuristic is to think that these 
connections consists of two arrows, one pointing in each direction. More precisely, every 
arrow has an equal and opposite arrow. 

Exercise 4.2.1.22. Which of the following graphs are symmetric: 



a.) 
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b. ) The graph G from (3.3)? 

c. ) The graph G from Exercise 3.3.1.8? 

d. ) The graph from (3.6)? 

e. ) The graph Coop from (3.15), i.e. the graph having exactly one vertex and one arrow? 

f. ) The graph G from Exercise 4.2.1.11? 



Exercise 4.2.1.23. Let C be the graph indexing category shown in (4.7) and let V be the 
symmetric graph indexing category displayed in (4.8). 

a. ) How many functors are there of the form C VI 

b. ) Is one more "reasonable" than the others? 

c. ) Choose the one that seems most reasonable and call Hi: C ^ V. If a symmetric graph 

is a functor S : V ^ Set, you can compose with i to get a functor S o i: C Set. 
This is a graph; what graph is it? What has changed? 



4.2.2 Database schemas present categories 

Recall from Definition 3.5.2.6 that a database schema (or schema, for short) consists of a 
graph together with a certain kind of equivalence relation on its paths. In Section 4.4.1 
we will define a category Sch that has schemas as objects and appropriately modified 
graph homomorphisms as morphisms. In Section 4.4.2 we prove that the category of 
schemas is equivalent (in the sense of Definition 4.3.4.1) to the category of categories, 

Sch - Cat. 

The difference between schemas and categories is like the difference between monoid 
presentations, given by generators and relations as in Definition 3.1.1.14, and the monoids 
themselves. The same monoid has (infinitely) many different presentations, and so it is 
for categories: many different schemas can present the same category. A schema is a 
compact form, and can be specified in finite space and time while generating something 
infinite. 

Slogan 4.2.2.1. 

" A database schema is a category presentation. " 

Example 4.2.2.2. The schema Coop^ depicted below, has no path equivalence declarations. 
As a graph it has one vertex and one arrow. 



Coop := 



The category it generates, however, is the free monoid on one generator, N. It has one 
object ▲ but a morphism /"^ : A ^ A for every natural number n G M, thought of as 
"how many times to go around the loop Clearly, the schema is more compact that 
the infinite category it generates. 
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Exercise 4.2.2.3. Consider the olog from Exercise 3.5.2.18, which says that for any father 
X, his first child's father is x. It is redrawn below as a schema <S, and we include the 
desired path equivalence declaration, F c f = 




How many morphisms are there (total) in the category generated by 5? 

Exercise 4.2.2.4. Suppose that G is a graph and that Q is the schema generated by G 
with no PEDs. What is the relationship between the category generated by Q and the 
free category F{G) e Ob(Cat) as defined in Example 4.1.2.27? 



4.2.2.5 Instances on a schema C 

If schemas are like categories, what are instances? Recall that an instance / on a schema 
S = (G, ^) assigns to each vertex v in G 8i set of rows say I{v) e Ob(Set). And to every 
arrow a: v ^ in G the instance assigns a function I (a) : I{v) I{v^)- The rule is that 
given two equivalent paths, their compositions must give the same function. Concisely, 
an instance is a functor I: S ^ Set. 

Example 4.2.2.6. We have now seen that a monoid is just a category with one ob- 
ject and that a monoid action is a functor Ai Set. Under our understanding of 
database schemas as categories, is a schema and so an action becomes an instance 
of that schema. The monoid action table from Example ex: action table was simply a 
manifestation of the database instance according to the Rules 3.5.2.8. 

Exercise 4.2.2.7. In Section 4.2.1.19 we discuss how each graph is a functor Q Set for 
the graph indexing category depicted below: 



A src y 



tgt 



But now we know that if a graph is a set-valued functor then we can consider ^ as a 
database schema. 

a. ) How many tables, and how many columns of each should there be (if unsure, consult 

Rules 3.5.2.8)? 

b. ) Write out the table view of graph G from Example 3.3.3.3. 



4.2.3 Spaces 

Category theory was invented for use in algebraic topology, and in particular to discuss 
natural transformations between certain functors. We will get to natural transformations 
more formally in Section 4.3. For now, they are ways of relating functors. In the original 
use, Eilenberg and Mac Lane were interested in functors that connect topological spaces 
(shapes like spheres, etc.) to algebraic systems (groups, etc.) 

For example, there is a functor that assigns to each space X its group 7ri(X) of round- 
trip voyages (starting and ending at some chosen point x G X), modulo some equivalence 
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relation. There is another functor that assigns to every space its group Hi (X, Z) of ways 
to drop some (positivie or negative) number of circles on X. These two functors are 
related, but they are not equal. 

There is a relationship between the functor tti and the functor Hi. For example 
when X is the figure-8 space (two circles joined at a point) the group 7Ti{X) is much 
bigger than the group Hi{X). Indeed 7Ti{X) includes information about the order and 
direction of loops traveled; whereas the group Z) includes only information about 

how many times one goes around each loop. However, there is a natural transformation 
of functors 7ri(— ) ,Z), called the Hurewicz transformation, which "forgets" the 

extra information and thus yields a simplification. 

Example 4.2.3.1. Given a set X, recall that P(X) denotes the set of subsets of X. A 
topology on X is a choice of which subsets U e P(X) will be called open sets. The union 
of any number of open sets must be considered to be an open set, and the intersection of 
any finite number of open sets must be considered open. One could say succinctly that 
a topology on X is a sub-order Op(X) ^ P(X) that is closed under taking finite meets 
and infinite joins. 

A topological space is a pair (X, Op(X)), where X is a set and Op(X) is a topology 
on X. The elements of the set X are called points. A morphism of topological spaces 
(also called a continuous map) is a function f : X ^ Y such that for every V g Op(F) 
the preimage f~^{V) g P(X) is actually in Op(X). That is, such that there exists a 
dashed arrow making the diagram below commute: 

Op(y) - - ^ Op(X) 
p(r)^-^p(x). 

The category of topological spaces, denoted Top, is the category having objects and 
morphisms as above. 

Exercise 4.2.3.2. a.) Explain how "looking at points" gives a functor Top Set. 
b.) Does "looking at open sets" give a functor Top PrO? 



Example 4.2.3.3. The set R can be given a topology in a standard way."^ But (R, 0, +) 
is also a monoid. Moreover, for every x g R the monoid operation + : R x R ^ R is 
continuous ^ So we say that IZ := (R, 0, +) is a topological monoid. 

Recall from Section 4.2.1.1 that a monoid action is a functor A4 Set, where Ai 
is a monoid. Instead imagine a functor a: 1Z ^ Top? Since 7^ is a category with one 
object, this amounts to an object X g Ob(Top), a space. And to every real number 
t G R we obtain a continuous map a{t): X ^ X. If we consider X as the set of states 
of some system and R as the time line, we have captured what is called a continuous 
dynamical system. 

Example 4.2.3.4. Recall (see []) that a real vector space is a set X, elements of which are 
called vectors, which is closed under addition and scalar multiplication. For example R^ 

'^The topology is given by saying that t/ Q R is open iff for every x E U there exists e > such that 
{2/g]R| \y — x\ < e} Q U}. One says, "?7 Q R is open if every point in U has an epsilon-neighborhood 
fully contained in t/". 

^The topology on R x R is similar; a subset U Q R x R is open if every point x E U has an epsilon- 
neighborhood (a disk around x of some positive radius) fully contained in U. 
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is a vector space. A linear transformation from X to Y is 8i function f : X Y that 
appropriately preserves addition and scalar multiplication. The category of real vector 
spaces, denoted Vect]^, has as objects the real vector spaces and as morphisms the linear 
transformations. 

There is a functor Vect]^ Grp sending a vector space to its underlying group of 
vectors, where the group operation is addition of vectors and the group identity is the 
0- vector. 

Exercise 4.2.3.5. Every vector space has vector subspaces, ordered by inclusion (the 
origin is inside of any line which is inside of certain planes, etc., and all are inside of the 
whole space V). If you know about this topic, answer the following questions. 

a. ) Does a linear transformation V ^ V induce a morphism of these orders? In other 

words, is there a functor Vect]^ PrO? 

b. ) Would you guess that there is a nice functor Vect]^ Top? By a "nice functor" I 

mean one that doesn't make people roll their eyes (for example, there is a functor 
Vect]^ Top that sends every vector space to the empty space, and that's not 
really a "nice" one. If someone asked for a functor VectjR Top for their birthday, 
this functor would make them sad. We're looking for a functor VectjR Top that 
would make them happy.) 



4.2.3.6 Groupoids 

Groupoids are like groups except a groupoid can have more than one object. 

Definition 4.2.3.7. A groupoid is a category C such that every morphism is an isomor- 
phism. If C and V are groupoids, a morphism of groupoids, denoted F: C ^ V, is simply 
a functor. The category of groupoids is denoted Grpd. 

Example 4.2.3.8. There is a functor Grpd Cat, sending a groupoid to its underlying 
category. There is also a functor Grp Grpd sending a group to "itself as a groupoid 
with one object." 

Application 4.2.3.9. Let M be a material in some original state sq.^ Construct a category 
Sm whose objects are the states of M, e.g. by pulling on M in different ways, or by 
heating it up, etc. we obtain such states. Include a morphism from state s to state 
s' if there exists a physical transformation from s to s'. Physical transformations can 
be performed one after another, so we can compose morphisms, and perhaps we can 
agree this composition is associative. Note that there exists a morphism is', sq ^ s for 
any s. Note also that this category is a preorder because there either exists a physical 
transformation or there does not. ^ 

The elastic deformation region of the material is the set of states s such that there 
exists a morphism s ^ sq, because any such morphism will be the inverse of is', sq ^ s. 
A transformation is irreversible if there is no transformation back. If si is not in the 
elastic deformation region, we can (inventing a term) still talk about the region that is 

^This example may be a bit crude, in accordance with the crudeness of my understanding of materials 
science. 

^Someone may choose to beef this category up to include the set of physical processes between states 
as the hom-set. This gives a category that is not a preorder. But there would be a functor from their 
category to ours. 



124 



CHAPTER 4. BASIC CATEGORY THEORY 



"elastically-equivalent" to si. It is all the objects in Sm that are isomorphic to si. If we 
consider only elastic equivalences, we are looking at a groupoid sitting inside the larger 
category Sm- 

00 

Example 4.2.3.10. Alan Weinstein explains groupoids in terms of tiling patterns on a 
bathroom floor, see [WeA]. 

Example 4.2.3.11. Let / = {x6R|0^x^l} denote the unit interval. It can be given 
a topology in a standard way, as a subset of R (see Example 4.2.3.3) 

For any space X, a path in X is a continuous map I ^ X. Two paths are called 
homotopic if one can be continuously deformed to the other, where the deformation 
occurs completely within X. ^ One can prove that being homotopic is an equivalence 
relation on paths. 

Paths in X can be composed, one after the other, and the composition is associative 
(up to homotopy). Moreover, for any point x E X there is a trivial path (that stays at 
x). Finally every path is invertible (by traversing it backwards) up to homotopy. 

This all means that to any space X we can associate a groupoid, called the funda- 
mental groupoid of X and denoted ni(X) e Ob(Grpd). The objects of ni(X) are the 
points of X; the morphisms in ni(X) are the paths in X (up to homotopy). A continu- 
ous map f : X ^ Y can be composed with any path / ^ X to give a path I ^ Y and 
this preserves homotopy. So in fact Hi : Top Grpd is a functor. 

Exercise 4.2.3.12. Let T denote the surface of a donut, i.e. a torus. Choose two points 
p^q E T. Since Ili{T) is a groupoid, it is also a category. What would the hom-set 
Homni(T)(P7 ^) represent? 
Exercise 4.2.3.13. Let [/ ^ be an open subset of the plane, and let F be any vector 
field on U. Following Exercise 4.1.1.10, we have a category Cf- Take it on faith (or read 
this to see) that if two curves C, in U are homotopic then they have the same line 
integral, F = ^• 

We also have a category Hit/, given by the fundamental groupoid, as in Example 
4.2.3.11. Both categories have the same objects, Ob(Ci?) = \U\ = Ob(ni/7), the set of 
points in U. 

a. ) Is there a functor Cp ^ ^lU or a functor UiU Cf that is identity on the under- 

lying objects? 

b. ) What is Cf if F is a conservative vector field? 



Exercise 4.2.3.14. Consider the set A of all (well-formed) arithmetic expressions in the 
symbols {0,... ,9,+,— ,*,(,)}. For example, here are some elements of A: 

52, 52 - 7, 50 + 3 * (6 - 2). 

We can say that an equivalence between two arithmetic expressions is a justification that 
they give the same "final answer", e.g. 52 + 60 is equivalent to 10 * (5 + 6) + (2 + 0), which 

^ Let = {{x,y) E \ ^ X ^ 1 and ^ y ^ 1} denote the square. There are two inclusions 
io,ii: I ^ S that put the interval inside the square at the left and right arrows. Two paths /o, /i : I ^ X 
are homotopic if there exists a continuous map /: I x I ^ X such that /o — f o {q and /i = / o ii, 
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is equivalent to 10 * 11 + 2. I've basically described a groupoid. What are its objects and 
what are its morphisms? 

4.2.4 Logic, set theory, and computer science 
4.2.4.1 The category of propositions 

Logical statements are either true or false. They are statements like 

For all real numbers x g R there exists a real number y eR such that y > 3x. 

Knowing that a similar statement with 3 replaced by 4 is true may or may not imply 
the above statement. There is a category Prop whose objects are logical statements 
(propositions) and whose morphisms are proofs that one statement implies another. 
Crudely, one might say that B holds at least as often as A if there is a morphism A^B 
(meaning whenever A holds, so does B). So the proposition "x ^ holds very seldom 
and "x = x" always very often. 

Example 4.2.4.2. We can repeat this idea for non- mathematical statements. Take all 
possible statements that are verifiable by experiment as objects of a category. Given 
two such statements, it may be that one implies the other (e.g. "if the speed of light is 
fixed then there are relativistic effects"). Every statement implies itself (identity) and 
implication is transitive, so we have a category. 

Let's consider differences in proofs to be irrelevant, so the category Prop becomes a 
preorder: either A implies B or it does not. Then it makes sense to discuss meets and 
joins. It turns out that meets are "and's" and joins are "or's". That is, given propositions 
A, B the meet AaB\^ defined to be a proposition that holds as often as possible subject 
to the constraint that it implies both A and B] the proposition "A holds and B holds" 
fits the bill. Similarly, the join Aw B is given by "A holds or B holds". 

Exercise 4.2.4.3. Consider the set of possible laws (most likely an infinite set) that can 
be dictated to hold throughout a jurisdiction. Consider each law as a proposition ("such 
and such is (dictated to be) the case"), i.e as an object of our preorder Prop. Given 
a jurisdiction V, and a set of laws {^1,^2, • • • ,^n} that are dictated to hold throughout 
F, we take their meet L(V) := £1 a -^2 a • • • a -^^ and consider it to be the single law of 
the land V . Suppose that F is a jurisdiction and U ^ F is a sub-jurisdiction (e.g. U is 
a county and V is a state); then clearly any law dictated by the large jurisdiction (the 
state) must also hold throughout the small jurisdiction (the county). 

a. ) What is the relation in Prop between L{U) and L{y)l 

b. ) Consider the obvious preorder J on jurisdictions, where U ^ V means U ^V. Is 

"the law of the land" a morphism of preorders J Prop? To be a bit more high- 
brow, considering both J and Prop to be categories (by Proposition 4.2.1.17), we 
have a function L: Ob(J) Ob(Prop); this question is asking whether L extends 
to a functor J Prop.^ 



Exercise 4.2.4.4. Take again the preorder J of jurisdictions from Exercise 4.2.4.3 and the 
idea that laws are propositions. But this time, let R{V) be the set of all possible laws 



^Hint: Exercises 4.2.4.3 and 4.2.4.4 will ask similar yes/no questions and at least one of these is 
correctly answered "no". 
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(not just those dictated to hold) that are in actuahty being respected, i.e. fohowed, by 
ah people in V. This assigns to each jurisdiction a set. 

a. ) Since preorders can be considered categories, does our "the set of respected laws" 

function R: Ob(J) Ob(Set) extend to a functor J Set? 

b. ) What about if instead we take the meet of all these laws and assign to each ju- 

risdiction the maximal law respected throughout. Does this assignment Ob (J) 
Ob (Prop) extend to a functor J Prop? ^ 



4.2.4.5 A categorical characterization of Set 

The category Set of sets is fundamental in mathematics, but instead of thinking of it 
as something given or somehow special, it can be shown to merely be a category with 
certain properties, each of which can be phrased purely categorically. This was shown 
by Lawvere [Law] . A very readable account is given in [Le2] . 

4.2.4.6 Categories in computer science 

Computer science makes heavy use of trees, graphs, orders, lists, and monoids. We have 
seen that all of these are naturally viewed in the context of category theory, though 
it seems that such facts are rarely mentioned explicitly in computer science textbooks. 
However, categories are also used explicitly in the theory of programming languages 
(PL). Researchers in that field attempt to understand the connection between what 
programs are supposed to do (their denotation) and what they actually cause to occur 
(their operation). Category theory provides a useful mathematical formalism in which 
to study this. 

The kind of category most often considered by a PL researcher is what is known 
as a Cartesian closed category or CCC^ which means a category T that has products 
(like A X B in Set) and exponential objects (like B^ in Set). Set is an example 
of a CCC, but there are others that are more appropriate for actual computation. 
The objects in a PL person's CCC represent the types of the language, types such 
as integers, strings, floats. The morphisms represent computable functions, e.g. 
length: strings — ^integers. The products allow one to discuss pairs (a, 6) where 
a is of one type and b is of another type. Exponential objects allow one to consider 
computable functions as things that can be input to a function (e.g. given any com- 
putable function f loats^integers one can consistently multiply its results by 2 and 
get a new computable function f loats^integers. We will be getting to products in 
Section 4.5.1.8 and exponential objects in Section 4.3.2. 

But category theory did not only offer a language for thinking about programs, it 
offered an unexpected tool called monads. The above CCC model for types allows re- 
searchers only to discuss functions, leading to the notion of functional programming 
languages; however, not all things that a computer does are functions. For example, 
reading input and output, changing internal state, etc. are operations that can be per- 
formed that ruin the functional- ness of programs. Monads were found in 19?? by Moggi 
[Mog] to provide a powerful abstraction that opens the doors to such non-functional 
operations without forcing the developer to leave the category-theoretic garden of eden. 
We will discuss monads in Section 5.3. 
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We have also seen in Section 4.2.2 that databases are well captured by the language of 
categories. We will formalize this in Section 4.4. Throughout the remainder of this book 
we will continue to use databases to bring clarity to concepts within standard category 
theory. 

4.2.5 Miscellaneous 

Categories are being used throughout mathematics to relate various subjects, as well 
as to draw out the essential structures within these subjects. For example, there is an 
active research for "categorifying" classical theories like that of knots, links, and braids 

[]■ 

Quantum field theory is also being categorified, with much success []. In this domain, 
one takes a category in which an object is a reasonable space, called a manifold, and a 
morphism is a manifold connecting two manifolds, like a cylinder connects two circles. 
Such connecting manifolds are called cobordisms, and as such people refer to the category 
as Cob. Topological quantum field theory is the study of functors Cob Vect that 
assign a vector space to each manifold and a linear transformation of vector spaces to 
each cobordism. 

Information theory is the study of how to ideally compress messages so that they 
can be sent quickly and accurately across a noisy channel. Invented in 1948 by Claude 
Shannon, its main quantity of interest is the number of bits necessary to encode a piece 
of information. For example, the amount of information in an English sentence can be 
greatly reduced. The fact that t's are often followed by h's, or that e's are much more 
common than z's, implies that letters are not being used as efficiently as possible. The 
amount of bits necessary to encode a message is called its entropy and has been linked 
to the commonly used notion of the same name in physics. 

In [], Baez and Leinster show that entropy can be captured quite cleanly using cate- 
gory theory. They make a category FinProb whose objects are finite sets equipped with 
a probability measure, and whose morphisms are probability preserving functions. They 
characterize information loss as a way to assign numbers to such morphisms, subject to 
certain explicit constraints. They then show that the entropy of an object in FinProb 
is the amount of information lost under the unique map to the singleton set {©}. This 
approach explicates (by way of the explicit constraints for information loss functions) the 
essential idea of Shannon's information theory, allowing it to be generalized to categories 
other than FinProb. Thus Baez and Leinster effectively categorified information theory. 

4.3 Natural transformations 

In this section we conclude our discussion of the Big 3, by defining natural transforma- 
tions. Category theory was originally invented to discuss natural transformations. These 

-•^^To me, the subject of "information theory" is badly named. That disciphne is devoted to finding 
ideal compression schemes for messages to be sent quickly and accurately across a noisy channel. It 
deliberately does not pay any attention to what the messages mean. To my mind this should be called 
compression theory or redundancy theory. Information is inherently meaningful — that is its purpose — 
any theory that is unconcerned with the meaning is not really studying information per se. The people 
who decide on speed limits for roads and highways may care about human health, but a study limited 
to deciding ideal speed limits should not be called "human health theory". 

■'^■'^ Despite what was said above. Information theory has been extremely important in a diverse array 
of fields, including computer science [], but also in neuroscience [] and physics []. I'm not trying to 
denigrate the field; I am only frustrated with its name. 
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were sufficiently conceptually challenging that they required formalization and thus the 
invention of category theory. If we think of categories as domains (of discourse, interac- 
tion, comparability, etc.) and of functors as transformations between different domains, 
the natural transformations compare different transformations. 

Natural transformations can seem a bit abstruse at first, but hopefully some examples 
and exercises will help. 



4.3.1 Definition and examples 

Definition 4.3.1.1. Let C and V be categories and let F : C ^ V and G: C ^ V he 
functors. A natural transformation a from F to G, denoted a: F G, consists of 
some constituents (A. components) conforming to some laws (1. naturality squares), as 
follows: 

A. For each object c e Ob(C) a morphism ac'. F{c) G{c) in P, called the c- 
component of a. 

The following law harmonizes the above constituents: 

1. For every morphism h: c ^ c' in C, the following square must commute: 



F(c) 

F(h) 



■G{c) 

G(h) 



(4.9) 



F{c')—^G{c') 



To think about a natural transformation between functors F^ G : C ^ V , Si picture to 
have in mind is this: 




r 



Both panes are pictures of C, and we imagine F as a mapping from the left pane 
to V and G as a mapping from the right pane to V. There is a horizontal arrows for 
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each object c e Ob(C) and we imagine these as the components ac of a. To each arrow 
h: c ^ inC we see a commutative square as in (4.9): its left side is F{h)^ its right side 
is G(/i), its top is ac and its bottom is ac'. This picture will be made formal in Example 
4.5.3.22. 

Application 4.3.1.2. In Figure 3.1 we drew a finite state machine on alphabet S = {a, 6}, 
and in Example 3.1.3.1 we showed the associated action table. It will be reproduced 
below. Imagine this was your model for understanding the behavior of some system 
when acted on by commands a and b. And suppose that a collaborator tells you that she 
has a more refined notion that fits with the same data. Her notion has 6 states rather 
than 3, but it's "compatible". What might that mean? 

Let's call the original state machine X and the new model Y. 



X: 



Y:-- 





The action tables for these two machines are: 



Original model X 


ID 


a 


b 


State 


State 1 


State 2 


State 1 


State 2 


State 1 


State 2 


State 


State 



Proposed model Y 


ID 


a 


b 


State 


State lA 


State 2A 


State lA 


State 2A 


State IB 


State IB 


State 2B 


State IC 


State IC 


State 2B 


State IB 


State 2A 


State 


State 


State 2B 


State 


State 



How are these models compatible? Looking at the table for F, if one removes the 
distinction between States lA, IB, IC and between States 2A and 2B, then one returns 
with the table for X. The table for Y is more specific, but it is fully compatible with 
table X. The sense in which it is compatible is precisely the sense defined by their being 
a natural transformation. 

Recall that Ad = (List(S), [ ], ^ ) is a monoid, and that a monoid is simply a category 
with one object, say Oh{Ai) = {a} (see Section 4.2.1). With S = {«, the monoid Ai 
can be visualized as follows: 



M 



Recall also that a state machine on A4 is simply a functor A4 Set. We thus have 
two such functors, X and Y. A natural transformation a : Y ^ X would consist of a 
component am for every object m E Ob(A1), such that certain diagrams commute. But 
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Ad having only one object, we need only one function - Y{^) ^{^)^ where Y{a) is 
the set of (6) states of Y and X(a) is the set of (3) states of X. 

The states of Y have been named so as to make the function particularly easy to 
guess. We need to check that two squares commute: 



r(A) 

Y{a) 

r(A) 



X(A) 

X(a) 
X{A) 



Y{A) 

Y(b) 

r(A) 



X(b) 
-X{A) 



(4.10) 



This can only be checked by going through and making sure certain things match, as 
specified by (4.10); we spell it out in gory detail. The columns that should match are 
those whose entries are written in blue. 



Naturality square for a: A ^ A 


r(A) [ID] 


Y{a) 


o Y{a) 




X{a) o 


State 


State lA 


State 1 


State 


State 1 


State lA 


State 2A 


State 2 


State 1 


State 2 


State IB 


State 2B 


State 2 


State 1 


State 2 


State IC 


State 2B 


State 2 


State 1 


State 2 


State 2A 


State 


State 


State 2 


State 


State 2B 


State 


State 


State 2 


State 



(4.11) 



Naturality square for 6: A ^ A 


r(A) [ID] 


Y{b) 


o Y{b) 




X{b)oa^ 


State 


State 2A 


State 2 


State 


State 2 


State lA 


State IB 


State 1 


State 1 


State 1 


State IB 


State IC 


State 1 


State 1 


State 1 


State IC 


State IB 


State 1 


State 1 


State 1 


State 2A 


State 


State 


State 2 


State 


State 2B 


State 


State 


State 2 


State 



(4.12) 



In reality we need to check that for every morphism in A^, such as [a, a, 6], a similar 
diagram commutes. But this holds automatically. For example (fiipping the naturality 
square sideways for typographical reasons) 



F(a) ^ F(a) ^ F(a) ^ F(a) 



X(A) 



X{a) 



X(A) 



X{a) 



X(A) 



Xib) 



■X{A) 



Since each small square above commutes (as checked by tables 4.11 and 4.12), the big 
outer rectangle commutes too. 



-•^^The function a^: Y(a.) makes the following assignments: State ^ State 0, State lA 

State 1, State IB ^ State 1, State IC ^ State 1, State 2A ^ State 2, State 2B ^ State 2. 
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To recap, the notion of compatibility between Y and X is one that can be checked 
and agreed upon by humans, but doing so it is left implicit, and it may be difficult to 
explain to an outsider what exactly was agreed to, especially in more complex situations. 
It is quite convenient to simply claim "there is a natural transformation from Y to X." 

00 

Exercise 4.3.1.3. Let F: C ^ P be a functor. Suppose someone said "the identity on F 
is a natural transformation from F to itself." 

a. ) What might they mean? 

b. ) If it is somehow true, what are the components of this natural transformation? 



Example 4.3.1.4. Let [1] g Ob(Cat) be the free arrow category described in Exercise 
4.1.2.28 and let V be any category. To specify a functor i^: [1] V requires the 
specification of two objects, F{vi)^ F{v2) e Oh{V) and a morphism F{e) : F{vi) F{v2) 
in V. The identity and composition formulas are taken care of once that much is specified. 
To recap, a functor F : [1] ^ P is the same thing as a morphism in V. 

Thus, choosing two functors F^G: [1] V is precisely the same thing as choosing 
two morphisms in V. Let us call them f : ao ^ ai and g bo ^ bi^ where to be clear we 
have / = F(e),ao = F{vo),ai = F{vi) and g = F(e),6o = G{vo),bi = G{vi). 

A natural transformation a: F ^ G consists of two components, ho := a^^ : ao ^ bo 
and hi := a^^ : ai ^ 6i, drawn as dashed lines below: 



ao 



ai - - ^ bi 

hi 



The condition for a to be a natural transformation is that the above square commutes. 

In other words, a functor [1] ^ P is an arrow in V and a natural transformation 
between two such functors is just a commutative square in V. 

Example 4.3.1.5. Recall that to any graph G we can associate the so-called paths-graph 
Paths(G), as described in Example 4.1.2.19. This is a functor Paths: Grph Grph. 
There is also an identity functor idcrph • Grph Grph. A natural transformation 
V' i^Grph Paths would consist of a graph homomorphism r]G ' ^<^Grph{G) Paths(G) 
for every graph G. But idGrph(G^) = G by definition, so we need rjc'- G ^ Paths(G). 
Recall that Paths (G) has the same vertices as G and every arrow in G counts as a path 
(of length 1). So there is an obvious graph homomorphism from G to Paths(G). It is 
not hard to see that the necessary naturality squares commute. 

Example 4.3.1.6. For any graph G we can associate the paths-graph Paths(G), and 
nothing stops us from doing that twice to yield a new graph Paths (Paths (G)). Let's 
think through what a path of paths in G is. It's a head-to-tail sequence of arrows in 
Paths(G), meaning a head-to-tail sequence of paths in G. These composable sequences of 
paths (or "paths of paths") are the individual arrows in Paths(Paths(G)). (The vertices 
in Paths (G) and Paths (Paths (G)) are the same as those in G, and all source and target 
functions are as expected.) 

Clearly, given such a sequence of paths in G, we could compose them to one big path in 
G with the same endpoints. In other words, there is graph morphism (iq • Paths(Paths(G)) 
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Paths(G), that one might cah "concatenation". In fact, this concatenation extends to a 
natural transformation 

/i : Paths o Paths Paths 

between functors Grph Grph. In Example 4.3.1.5, we compared a graph to its paths- 
graph using a natural transformation idcrph ^ Paths; here we are making a similar kind 
of comparison. 

Remark 4.3.1.7. In Example 4.3.1.5 we saw that there is a natural transformation sending 
each graph into its paths-graph. There is a formal sense in which a category is nothing 
more than a kind of reverse mapping. That is, to specify a category is the same thing 
as to specify a graph G together with a graph homomorphism Paths (G) G. The 
formalities involve monads, which we will discuss in Section 5.3. 

Example 4.3.1.8. For any set X, there is a set List(X) consisting of all lists whose 
entries are elements of X. In fact List: Set Set is a functor because for any function 
f: X ^ Y we can apply it entry-by-entry to a list of X's to get a list of F's (this was 
worked out in Exercise 4.1.2.17). But then we can compose List with itself to get a functor 
List o List : Set Set. Given X e Ob(Set), the set ListList(X) is the set of lists of lists. 
For example if X = {1, 2, 3} then ListList(X) contains [ [1, 2, 3, 2], [2, 2] , [ ], [3] ]. 

There is a natural transformation B: ListList List given by taking a list of 
lists and flattening it to a list. For example the above element would be sent under 
Bx- ListList(X) List(X) to [1,2,3,2,2,2,3]. It is not hard to see that the relevant 
naturality square commutes for any f.X^Y. 

Exercise 4.3.1.9. Let List: Set Set be as above. 

a. ) If someone said "singleton lists give a natural transformation from idset to List", 

what might they mean? 

b. ) Would they be correct? 



Exercise 4.3.1.10. Let X and Y be sets, and let /: X Y. There is a functor 
Cx • Grph Set that sends every graph to the set X and sends every morphism 
of graphs to the identity morphism id^ : X ^ X. This functor is called the constant 
functor at X. Similarly there is a constant functor Cy ' Grph Set. 

a. ) Use / to construct a natural transformation Cx Cy- 

b. ) What are its components? 



Exercise 4.3.1.11. For any graph (V, A, src, tgt) we can extract the set of arrows or the set 
of vertices. Since each morphism of graphs includes a function between their arrow sets 
and a function between their vertex sets, we actually have functors Ar: Grph Set 
and Ve: Grph Set. 

a. ) If someone said "taking source vertices gives a natural transformation from Ar to 

Fe", what natural transfromation might they be referring to? 

b. ) What are its components? 

c. ) If a different person, say from a totally different country, were to say "taking target 

vertices also gives a natural transformation from Ar to Fe," would they also be 
correct? 
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Example 4.3.1.12. [Graph homomorphisms are natural transformations] 

As discussed above (see Diagram 4.7), there is a category C for which a functor 
G: C ^ Set is the same thing as a graph. Namely, we have 




C :-- 



A natural transformation of two such functors a: G G' involves two components, 
aA'- G{A) G^{A) and ay - G{V) G\V)^ and two naturality squares, one for src 
and one for tgt. This is precisely the same thing as a graph homomorphism, as defined 
in Definition 3.3.3.1. 



4.3.2 Vertical and horizontal composition 

In this section we discuss two types of compositions for natural transformations. The 
terms vertical and horizontal are used to describe them; these terms come from the 
following pictures: 





We generally use o to denote both kinds of composition, but if we want to be very clear we 
will differentiate as follows: l3oa : F ^ H for vertical composition, and 72^71 ' F20F1 — > 
G2 o Gi for horizontal composition. Of course, the actual arrangement of things on a 
page of text does not correlate with verticality or horizontality — these are just names. 
We will define them more carefully below. 



4.3.2.1 Vertical composition of natural transformations 

The following proposition proves that functors and natural transformations (using ver- 
tical composition) form a category. 

Proposition 4.3.2.2. Let C and V he categories. There exists a category, called the 
category of functors from C to V and denoted Fun(C,P); whose objects are the functors 
C ^ V and whose morphisms are the natural transformations, 

llomYun{c,v){F^G) = {a: F ^ G \ a is Si natural transformation}. 

That is, there are identity natural transformations, natural transformations can be com- 
posed, and the identity and associativity laws hold. 

Proof. We showed in Exercise 4.3.1.3 that there for any functor F: C P, there 
is an identity natural transformation idF- F F (its component at c g Ob(C) is 
id^(,) : F(c) - F(c)). 

Given a natural transformation a: F ^ G and a natural transformation (3: G ^ 
we propose for the composite (3 o a the transformation ^y: F ^ H having components 
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Pc o <^c for every c E Ob(C). To see that 7 is indeed a natural transformation, one simply 
puts together naturality squares for a and /3 to get naturality squares for /3 o a. 

The associativity and identity laws for Fun(C, V) follow from those holding for mor- 
phisms in V. 

□ 

Notation 4.3.2.3. We sometimes denote the category Fun(C,P) by . 

Example 4.3.2.4. Recall from Exercise 4.1.2.35 that there is a functor Ob: Cat Set 
sending a category to its set of objects. And recall from Example 4.1.2.32 that there 
is a functor Disc: Set Cat sending a set to the discrete category with that set of 
objects (all morphisms in Disc{S) are identity morphisms). Let P: Cat Cat be the 
composition P = Disc o Ob. Then P takes a category and makes a new category with 
the same objects but no morphisms. It's like crystal meth for categories. 

Let idcat • Cat Cat be the identity functor. There is a natural transformation 
i: P ^ idcat- For any category C, the component ic : P{C) ^ C is pretty easily under- 
stood. It is a morphism of categories, i.e. a functor. The two categories P{C) and C 
have the same set of objects, namely Ob(C), so our functor is identity on objects; and 
P(C) has no non-identity morphisms, so nothing else needs be specified. 



Exercise 4.3.2.5. Let C = [•J be the category with Ob(C) = {A}, and llomc{A^ A) = 
{id^}. What is Fun(C, Set)? In particular, characterize the objects and the morphisms. 



Exercise 4.3.2.6. Let n g M and let n be the set with n elements, considered as a discrete 
category (i.e. we abuse notation and write n to mean what should really be called 
Disc{n)). Describe the category Fun(3, 2). 

Exercise 4.3.2.7. Let 1 denote the discrete category with one object, and let C be any 
category. 

a. ) What are the objects of Fun(l, C)? 

b. ) What are the morphisms of Fun(l, C)? 



Example 4.3.2.8. Let 1 denote the discrete category with one object (also known as the 
trivial monoid). For any category C, we investigate the category V := Fun(C,l). Its 
objects are functors C ^ 1. Such a functor F assigns to each object in C an object in 1 
of which there is one; so there is no choice in what F does on objects. And there is only 
one morphism in 1 so there is no choice in what F does on morphisms. The upshot is 
that there is only one object in P, let's call it F, in P, so P is a monoid. What are its 
morphisms? 

A morphism a: F F in P is a natural transformation of functors. For every 
c 6 Ob(C) we need a component ac'. F{c) F{c), which is a morphism 1 ^ 1 in 1. 
But there is only one morphism in 1, namely idi, so there is no choice about what these 
components should be: they are all idi. The necessary naturality squares commute, so 
a is indeed a natural transformation. Thus the monoid V is the trivial monoid; that is, 
Fun(C, 1) = 1 for any category C. 

Exercise 4.3.2.9. Let represent the discrete category on objects; it has no objects and 
no morphisms. Let C be any category. What is Fun(0, C)? 
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Exercise 4.3.2.10. Let [1] denote the free arrow category as in Exercise 4.1.2.28, and 
let C be the graph indexing category from (4.7). Draw the category Fun([l],C). Say 
whether any diagram you see commutes or not. 



4.3.2.11 Natural isomorphisms 

Let C and V be categories. We have defined a category Fun(C, V) whose objects are 
functors C ^ V and whose morphisms are natural transformations. What are the 
isomorphisms in this category? 

Lemma 4.3.2.12. LetC andV he categories and let F^G: C ^ V he functors. A natural 
transformation a: F ^ G is an isomorphism in Fun(C,P) if and only if the component 
ac'. F{c) G{c) is an isomorphism for each ohject c g Ob(C). 

Proof. First suppose that a is an isomorphism with inverse 13: G ^ F^ and let Pc • G{c) 
F[c) denote its c component. We know that a o p = idc and j3 o a = idp- Using the 
definitions of composition and identity given in Proposition 4.3.2.2, this means that for 
every c e Ob(C) we have ac o (3c = ^dc^c) /3c o ac = idi?(c); other words ac is an 
isomorphism. 

Second suppose that each ac is an isomorphism with inverse /3c: G{c) F{c). We 
need to see that these components assemble into a natural transformation; i.e. for every 
morphism f : c ^ c' in C the right-hand square 



F{c)- 

F(h) 

F{c') 



■G{c) 

G(h) 

■G{c') 



G{c) 

G(h) 

G{c') 



-F{c) 

F(h) 

F(c') 



commutes. We know that the left-hand square commutes because a is a natural trans- 
formation; we have labeled each square with a ? or a / accordingly. In the following 
diagram we want to show that the left-hand square commutes. We know that the middle 
square commutes. 



IdG(c) 




To complete the proof we need only to show that F{h) o pc = /3c' ^ G{h). This can be 
shown by a "diagram chase." We go through it symbolically, for demonstration. 



F{h) o pc = Pc o ac' o F{h) o pc = Pc o G{h) oacO/3c = Pc' o G{h). 



□ 
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Exercise 4.3.2.13. Recall from Application 4.3.1.2 that a finite state machine on alphabet 
S can be understood as a functor M Set, where M. = List(i;) is the free monoid 
generated by S. In that example we also discussed how natural transformations provide 
a nice language for changing state machines. Describe what kinds of changes are made 
by natural isomorphisms. 

4.3.2.14 Horizontal composition of natural transformations 

Example 4.3.2.15. Suppose that M = List(a, b) and = List(m, n,p) are free monoids, 
and let F: Ai' ^ be given by sending [m] ^ [a], [n] ^ [6], and [p] ^ [6, a, a]. An 
application of this might be if the sequence [6, a, a] was commonly used in practice and 
one wanted to add a new button just for that sequence. 

Recall Application 4.3.1.2. Let X\ M. ^ Set and Y : Ai ^ Set be the functors, and 
let a : y ^ X be the natural transformation found there. We reproduce them here: 





Original model X : A4 — >■ Set 


ID 


a 


b 


State 


State 1 


State 2 


State 1 


State 2 


State 1 


State 2 


State 


State 



Proposed model Y : A4 Set 


ID 


a 


b 


State 


State lA 


State 2A 


State lA 


State 2A 


State IB 


State IB 


State 2B 


State IC 


State IC 


State 2B 


State IB 


State 2A 


State 


State 


State 2B 


State 


State 



We can compose X and Y with F as in the diagram below 




Set 



to get functors Y o F and X o F, both of type Set. What would these be? 



X oF 


ID 


m 


n 


P 


State 


State 1 


State 2 


State 1 


State 1 


State 2 


State 1 


State 


State 2 


State 


State 


State 2 



Y oF 


ID 


m 


n 


P 


State 


State lA 


State 2A 


State lA 


State lA 


State 2A 


State IB 


State 


State IB 


State 2B 


State IC 


State 


State IC 


State 2B 


State IB 


State 


State 2A 


State 


State 


State 2A 


State 2B 


State 


State 


State 2A 



The map a is what sent both State lA and State IB in Y to State 1 in X, and 
so on. We can see that "the same a works now:" the p column of the table respects 
that mapping. But a was a natural transformation Y ^ X where as we need a natural 
transformation Y o F X o F. This is called whiskering. It is a kind of horizontal 
composition of natural transformation. 



■"^^The p-column comes from applying b then a then a, as specified above by F. 
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> V be functors, 
C (respectively 





Definition 4.3.2.16. Let S,C,P, and £ be categories, let Gi,G2: C 
and let a: Gi ^ G2 a natural transformation. Suppose that F: B - 
H: C ^ V) is di functor, depicted below: 

/ 

respectively, 

V 

Then the pre-whiskering of a by F, denoted a o F: Gi o F ^ G2 ^ F (respectively, the 
post-whiskering of a by denoted H o a: H o Gi H o G2) is defined as follows. 

For each b e Oh{B) the component {a o F)^: Gi o F{b) ^ G2 o F{b) is defined to 
be (Respectively, for each c e Ob(C) the component {H o a)c'. H o Gi(c) 

H o G2(c) is defined to be H{ac).) Checking that the naturality squares (in each case) 
is straightforward. 

The rest of this section can safely be skipped; I include it only for my own sense of 
completeness. 

Definition 4.3.2.17 (Horizontal composition of natural transformations). Let S,C, and 
V be categories, let ^1,^2: B ^ C and Gi, G2 : C ^ P be functors, and let a: Fi ^ F2 
and /3 : Gi ^ G2 be natural transformations, as depicted below: 



B 



F2 



V 



By pre- and post-whiskering in one order or the other we get the following diagram 

Gi o Fi > Gi o F2 



/3oFi 
G20F1 



G2OQ! 



/30F2 



^ G2 o F2 



It is straightforward to show that this diagram commutes, so we can take the composition 
to be our definition of the horizontal composition 

/3 oa: Gi o Fi ^ G2 o F2 . 

Remark 4.3.2.18. All of the above is somehow similar to the world of paths inside a 
database schema, as seen in Definition 3.5.2.3. The fact that our CPER is an equivalence 
relation is analogous to the fact that natural transformations are closed under vertical 
composition. Properties (3) and (4) in Definition 3.5.2.3 are analogous to pre- and post 
whiskering. See also Lemma 3.5.2.5. 

Theorem 4.3.2.19. 
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Given a setup of categories, functors, and natural transformations as above, we have 
(/32 o/3i)o(a2 oai) = (/32 00^2) o oai). 

Proof One need only observe that each square in the fohowing diagram commutes, so 
following the outer path (/32 o o (a2 o ai) yields the same morphism as following the 
diagonal path ; (/32 o 0^2) o o ai): 

Kjriri ^ Lxir2 ^ ^1^3 

/3ioFi 



^2 0Fi 


/320F2 




/32 0i^3 











— ^ G3^2 — ^ G3F3 

Gr30Q!l Gr30Q!2 



/3ioF2 



/3ioF3 



□ 



4.3.3 The category of instances on a database schema 

In Section 4.2.2 we showed that schemas are presentations of categories, and we will 
show in Section 4.4 that in fact the category of schemas is equivalent to the category 
of categories. In this section we therefore take license to blur the distinction between 
schemas and categories. 

If C is a schema, i.e. a category, then as we discussed in Section 4.2.2.5, an instance 
on C is a functor I: C ^ Set. But now we have a notion beyond categories and functors, 
namely that of natural transformations. So we make the following definition. 



Definition 4.3.3.1. Let C be a schema (or category). The category of instances on C, 
denoted C-Set, is Fun(C,Set). Its objects are C-instances (i.e. functors C Set) and 
its morphisms are natural transformations. 



Natural transformations should serve as some kind of morphism between instances on 
the same schema. How are we to interpret a natural transformation a: I ^ J between 
database instances /, J: C ^ Set? 

Our first clue comes from Application 4.3.1.2. There we considered the case of 
a monoid TW, and we thought about a natural transformation between two functors 
X, F: Ai Set, considered as different finite state machines. The notion of natural 
transformation captured the idea of one model being a refinement of another. This same 
kind of idea works for databases with more than one table (categories with more than 
one object), but the whole thing is a bit opaque. Let's work it through slowly. 



^Grapes 



An instance is a 



Example 4.3.3.2. Let us consider the terminal schema, 1 
functor 1 Set and it is easy to see that this is the same thing as just a set. A natural 
transformation a: / ^ J is a function from set / to set J. In the standard table view, 
we might have / and J as below: 
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Grapes (J) 



Grapes (/) 
ID 

Grape 1 
Grape 3 
Grape 4 

I Jan4-Ql 
Jan4-02 



Janl-01 
Janl-Q2 
Janl-Q3 
Janl-Q4 
Jan3-01 



There are 343 natural transformations I ^ J. Perhaps some of them make more sense 
than others; e.g. we could hope that the numbers in / corresponded to the numbers after 
the dash in J, or perhaps to what seems to be the date in January. But it could be that 
the rows in J correspond to batches, and all three grapes in / are part of the first batch 
on Jan-1. The notion of natural transformation is a mathematical one. 



Exercise 4.3.3.3. Recall the notion of set-indexed sets from Definition 2.7.6.12. Let A be 
a set, and come up with a schema A such that instances on A are A-indexed sets. Is our 
current notion of morphism between instances (i.e. natural transformations) well-aligned 
with the above definition of "mapping of A-indexed sets"? 

For a general schema (or category) C, let us think through what a morphism a: I ^ J 
between instances I^J: C ^ Set is. For each object c e Ob(C) there is a component 
ac'. I{c) J{c). This means that just like in Example 4.3.3.2, there is for each table c 
a function from the rows in /'s manifestation of c to the rows in J's manifestation of c. 
So to make a natural transformation, such a function has to be specified table by table. 
But then we have to contend with naturality squares, one for every arrow in C. Arrows 
in C correspond to foreign key columns in the database. The naturality requirement was 
already covered in Application 4.3.1.2 (and see especially how (4.10) is checked in (4.11) 
and (4.12)). 

Example 4.3.3.4. We saw in Section 4.2.1.19 that graphs can be regarded as functors 
Q Set, where Q is the "schema for graphs" shown here: 



G:- 



src Vertex 



tgt 



A database instance I: Q ^ Set on Q consists of two tables. Here is an example 
instance: 



/ : = 




Arrow (/) 


ID 


src 


tgt 


/ 


V 


w 


9 


w 


X 


h 


w 


X 



Vertex (/) 

Id 



w 
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To discuss natural transformations, we need two instances. Here is another, J: Q ^ Set, 



J: = 




Arrow (J) 



ID 


src 


tgt 


i 


q 


r 


j 


r 


s 


k 


s 


r 


i 


s 


t 



Vertex (J) 



ID 



To give a natural transformation a: / ^ J, we give two components: one for arrows 
and one for vertices. We need to say where each vertex in / goes in J and we need to 
say where each arrow in / goes in J. The naturality squares insist that if we specify 
that g ^ for example, then we better specify thati(; ^ r and that x ^ s. What a 
computer is very good at, but a human is fairly slow at, is checking that a given pair of 
components (arrows and vertices) really is natural. 

There are 8000 ways to come up with component functions aArrow and ^vertex, but 
precisely four natural transformations, i.e. four graph homomorphisms, / ^ J; the other 
7996 are haphazard fiingings of arrows to arrows and vertices to vertices without any 
regard to sources and targets. We briefly describe the four now. 

First off, nothing can be sent to u because arrows must go to arrows and u touches 
no arrows. If we send v ^ q then / must map to i, and w must map to r, and both g 
and h must map to j, and x must map to 5. If we send v ^ r then there are two choices 
for g and h. If we send v ^ s then there's one way to obtain a graph morphism. If we 
try to send v we fail. All of this can be seen by staring at the tables rather than 

at the pictorial representations of the graphs; the human eye understands these pictures 
better, but the computer understands the tables better. 



Exercise 4.3.3.5. If I,J: G - 
formations are there J ^ II 



Set are as in Example 4.3.3.4, how many natural trans- 





Exercise 4.3.3.6. Let Ya'- Q ^ Set denote the instance below: 



Arrow (Ya) 


ID 


src 


tgt 


a 


^0 





Vertex {Ya) 



ID 



^0 



Let I : Q ^ Set be as in Example 4.3.3.4. 

a.) How many natural transformations are there Ya 



b. ) With J as above, how many natural transformations are there Ya J? 

c. ) Do you have any conjecture about the way natural transformations Ya ^ X behave 

for arbitrary graphs X : Q ^ Set? 



In terms of databases, this notion of instance morphism / ^ J is fairly benign. For 
every table its a mapping from the set of rows in /'s version of the table to J's version 
of the table, such that all the foreign keys are respected. We will see that this notion of 
morphism has excellent formal properties, so that projections, unions, and joins of tables 
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(the typical database operations) would be predicted to be "obviously interesting" by a 
category theorist who had no idea what a database was. 

However, something is also missing from the natural transformation picture. A very 
important occurrence in the world of databases is the update. Everyone can understand 
this: a person makes a change in one of the tables, like changing your address from 
Cambridge, MA to Hereford, UK. Most such arbitrary changes of database instance are 
not "natural", in that the new linking pattern is incompatible with the old. 

It is interesting to consider how updates of C-instances should be understood category 
theoretically. We might want a category Updc whose objects are C-instances and whose 
morphisms are updates. But then what is the composition formula? Is there a unique 
morphism I ^ J whenever J can be obtained as an update on /? Because in that case, 
we would be defining Updc to be the indiscrete category on the set of C-instances (see 
Example 4.3.4.3). 

Exercise 4.3.3.7. Research project: Can you come up with a satisfactory way to model 
database updates category-theoretically? Let M be the category 

[M]:= • 

representing a discrete timeline. A place to start might be to use something like the slice 
category Cat/[iq] where the fiber over each object in M is a snapshot of the database in 
time. Can you make this work? 

4.3.4 Equivalence of categories 

We have a category Cat of categories, and in every category their is a notion of isomor- 
phism between objects: one morphism each way, such that each round-trip composition 
is the identity. An isomorphism in Cat, therefore, takes place between two categories, 
say C and V: it is a functor F: C ^ V and a functor G: V ^ C such that G o F = idc 
and F o G = id-z) . 

It turns out that categories are often similar enough to be considered equivalent 
without being isomorphic. For this reason, the notion of isomorphism is considered "too 
strong" to be useful for categories. The feeling to a category theorist might be akin to 
saying that two material samples are the same if there is an atom-by-atom matching, or 
that two words are the same if they are written in the same font, of the same size, by 
the same person, in the same state of mind. 

As reasonable as isomorphism is as a notion in most categories, it fails to be the 
"right notion" about categories. The reason is that in categories there are objects and 
morphisms, whereas when we talk about categories, we have categories and functors, plus 
natural transformations. These serve as mappings between mappings, and this is not 
part of the structure of an ordinary category. In cases where a category C does have such 
mappings between mappings, it is often a "better notion" if we take that extra structure 
into account, like we will for categories. This whole subject leads us to the study of 
2-categories (or n-categories, or oo-categories), which we do discuss in this book. See, 
for example, [Lei] for an introduction. 

Regardless, our purpose now is to explain this "good notion" of sameness for cat- 
egories, namely equivalences of categories^ which appropriately take natural transfor- 
mations into account. Instead of "functors going both ways with round trips equal to 



-•^^More precisely, given a functor between schemas F : C ^ V, the pullback Ap : V-Set C-Set, its 
left Ei? and its right adjoint Hp constitute these important queries. See Section 5.1.4. 
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identity", which is required in order to be an isomorphism of categories, equivalence of 
categories demands "functors going both ways with round trips isomorphic to identity". 

Definition 4.3.4.1. Let C and C be categories. A functor F: C ^ C is called an 
equivalence of categories, and denoted F: C ^ if there exists a functor : ^ C 

and natural isomorphisms a: idc ^ F' o F and a' : idc ^ F o F' . In this case we say 
that F and F' are mutually inverse equivalences. 

Unpacking a bit, suppose we are given functors F \ C ^ C and F' \ C C. We want 
to know something about the roundtrips on C and on we want to know the same 
kind of information about each roundtrip, so let's concentrate on the C side. We want 
to know something about F' o F: C ^ so let's name it i: C ^ C; we want to know 
that z is a natural isomorphism. That is, for every c e Ob(C) we want an isomorphism 
ac : c ^ i(c), and we want to know that these isomorphisms are picked carefully enough 
that given g: c ^ c' mC, the choice of isomorphisms for c and c' are compatible, 

C ^ i[c) 

9 i{9) 

To be an equivalence, the same has to hold for the other roundtrip, i' = F o F' : C ^ C . 

Exercise 4.3.4.2. Let C and C be categories. Suppose that F: C ^ CMs an isomorphism 
of categories. 

a. ) Is it an equivalence of categories? 

b. ) What are the components of a and a' (with notation as in Definition 4.3.4.1)? 



Example 4.3.4.3. Let 5 be a set and lei S x S ^ S x S he the complete relation on 
which is a preorder Ks- Recall from Proposition 4.2.1.17 that we have a functor 
i \ PrO Cat, and the resulting category i{Ks) is called the indiscrete category on S; 
it has objects S and a single morphism between every pair of objects. Here is a picture 

of ^{1,2,3}- 

1 2 



It is easy check that i^i, the indiscrete category on one element, is isomorphic to 1, 
the discrete category on one object, also known as the terminal category (see Exercise 
4.1.2.34). The category 1 consists of one object, its identity morphism, and nothing else. 

-•^^The notation ~ has already been used for equivalences of paths in a schema. We do not mean to 
equate these ideas; we are just reusing the symbol. Hopefully no confusion will arise. 
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The only way that Ks can be isomorphic to 1 is if 5* has one element. On the 
other hand, there is an equivalence of categories 

for every set 6* 7^ 0. 

In fact, there are many such equivalences, one for each element of S. To see this, 
let 5 be a nonempty set and choose an element sq e S. For every s e there is a 
unique isomorphism ks'. s ^ sq in Ks. Let F: Ks ^ 1 be the only possible functor 
(see Exercise 4.1.2.34), and let F^ : 1^ Ks send the unique object in 1 to the object sq. 

Note that o F = idi : 1 ^ 1 is the identity, but that F o F' : Ks ^ Ks sends 
everything to sq. Let a = idi and define a' : id^^ F o F' hy a'^ = ks. Note that is 
an isomorphism for each s E Ob(i^5'), and note that a' is a natural transformation (hence 
natural isomorphism) because every possible square commutes in Ks. This completes 
the proof, initiated in the paragraph above, that the category Ks is equivalent to 1 for 
every nonempty set and that this fact can be witnessed by any element sq e S. 

Example 4.3.4.4. Consider the category FLin, described in Exercise 4.1.1.8, of finite 
nonempty linear orders. For every natural number n G M, let [n] e Ob(FLin) denote the 
linear order shown in Example 3.4.1.6. Define a category A whose objects are given by 
Ob(A) = {[n] I n G M} and with HomA([^], [n]) = HomFLin([^], M)- The difference 
between FLin and A is only that objects in FLin may have "funny labels", e.g. 

5 X ^^Sam'' 
• ^ • ^ • 

whereas objects in A all have standard labels, e.g. 

012 
• ^ • ^ • 

Clearly FLin is a much larger category, and yet feels like it is "pretty much the same 
as" A. Justly, they are equivalent, FLin ^ A. 

The functor F^ : A ^ FLin is the inclusion; the functor F : FLin A sends every 
finite nonempty linear order X e Ob(FLin) to the object F{X) := [n] e A, where 
Oh{X) ^ {1,2,. ..,n}. For each such X there is a unique isomorphism ax : X [n], 
and these fit together into the required natural isomorphism idFLin F' o F. The 
other natural isomorphism a' : id^ F o F' is the identity. 

Exercise 4.3.4.5. Recall from Definition 2.1.2.15 that a set X is called finite if there 
exists a natural number n g M and an isomorphism of sets X ^ n. Let Fin denote the 
category whose objects are the finite sets and whose morphisms are the functions. Let 
S denote the category whose objects are the sets n and whose morphisms are again the 
functions. For every object X e Ob(Fin) there exists an isomorphism px : X ^ n for 
some unique object n e Oh{S). Find an equivalence of categories Fin ^ S. 

Exercise 4.3.4.6. We say that two categories C and V are equivalent if there exists an 
equivalence of categories between them. Show that the relation of "being equivalent" is 
an equivalence relation on Ob (Cat). 

-•^^One way to see this is that by Exercise 4.1.2.35, we have a functor Ob: Cat Set, and we know 
by Exercise 4.1.2.21 that functors preserve isomorphisms, so an isomorphism between categories must 
restrict to an isomorphism between their sets of objects. The only sets that are isomorphic to 1 have 
one element. 

-•^^The phrase "these fit together into" is suggestive shorthand for, and thus can be replaced with, the 
phrase "the naturality squares commute for these components, so together they constitute". 
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Example 4.3.4.7. Consider the group Z2 := ({0, 1}, 0, +), where 1 + 1 = 0. As a category, 
Z2 has one object A and two morphisms, namely 0, 1, such that is the identity. Since 
Z2 is a group, the morphism 1 : A ^ A must have an inverse x, meaning 1 + x = 0, and 
X = 1 is the only solution. 

The point is that the morphism 1 in Z2 is an isomorphism. Let C = 1 be the terminal 
category as in Exercise 4.1.2.34. One might accidentally believe that C is equivalent 
to Z2, but this is not the case! The argument in favor of the accidental belief is that 
we have unique functors F: Z2 C and : C ^ Z2 (and this is true); the roundtrip 
F o F' : C ^ C is the identity (and this is true); and for the roundtrip o F: Z2 — > Z2 
both morphisms in Z2 are isomorphisms, so any choice of morphism a^: A ^ F' o F{a) 
will be an isomorphism (and this is true). The problem is that no such will be a 
natural transformation. 

When we roundtrip F' o F: Z2 ^ Z2, the image of 1 : A ^ A is o F{1) = = id^. 
So the naturality square for the morphism 1 looks like this: 



0=F'oF(l) 



where we still haven't decided whether we want to be or 1. Unfortunately, neither 
choice works (i.e. for neither choice will the diagram commute) because x ^ x -\-0 in 
Z2. 

Definition 4.3.4.8 (Skeleton). Let C be a category. We saw in Lemma 4.1.1.16 that 
the relation of "being isomorphic" is an equivalence relation = on Ob(C). An election in 
C is a choice E of the following sort: 




for each ^-equivalence class S ^ Ob(C) a choice of object se ^ called the elected 
object for and 

for each object c e Ob(C) a choice of isomorphism ic'. se ^ c and jc - c ^ se with 
o jc = idc and jc o ic = id^^, where is an elected object. 



Given an election E in C, there is a category called the E -elected skeleton of denoted 
Skel£;(C), whose objects are the elected objects and whose morphisms 5 ^ t for any 
elected objects s^t e Ob(C) are given by Homskei^(^(j)(5, t) = Homc(s,t). Any object 
c G Ob(C) is isomorphic to a unique elected object 5^; we refer to 5^; as the elected rep- 
resentative of c; we refer to the isomorphisms ic and jc as the representing isomorphisms 
for c. 

Proposition 4.3.4.9. Let C he a category and let E he an election in C. There is an 
equivalence of categories 

Skel£;(C) ^C. 

Proof. The functor F' : Skel£;(C) ^ C is the inclusion. The functor F:C ^ Skel£;(C) 
sends each object in C to its elected representative. Given objects c, c' g Ob(C) with 
elected representatives 5, t respectively, and given a morphism g: c ^ cMn C, let ic^jc^ 
and jc' be the representing isomorphisms, and define F(g) : s ^ t to he the composite 
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This is functorial because it sends the identity to the identity and F{gog') = F{g)oF{g'). 

The composite FoF': Skel£;(C) Skel£;(C) is the identity. For each c e Ob(C) define 
Qfc • c ^ F' o F{c) hy ac jc- Given g: c ^ c' the required naturahty square is shown 
to the left below: 



jc ic 

c ^ s ^ c 





? 


F'oF{g) 




Y 









c' ^ t ^ c' 

j'c j'c 

The right-hand part commutes by definition of F and F'\ i.e. j' o goi^ = F' o F{g). The 
left-hand square commutes because ic o = idc- 

□ 

Definition 4.3.4.10. A skeleton o/C is a category <S, equivalent to C, such that for any 
two objects s, e 0b(5), if s ^ then s = s\ 

Exercise 4.3.4.11. Let P be a preorder (considered as a category). 

a. ) If T^Ms a skeleton of is it a partial order? 

b. ) Is every partial order the skeleton of some preorder? 



Definition 4.3.4.12. Let C and V be categories, and let F: C P be a functor. 
For any two objects c, e Ob(C), we have a function Homi7(c, c'): Home (c,c^) 
Homx^(F(c), F(c')) guaranteed by the definition of functor. We say that F is a full 
functor if Homj? (c,c') is surjective for every c, c'. We say that F is a faithful functor 
if Homi?(c, c') is injective for every c, c'. We say that F is a fully faithful functor if 
Homi?(c, c') is bijective for every c, c^ 

Exercise 4.3.4.13. Let 1 and 2 be the discrete categories on one and two objects, respec- 
tively. There is only one functor 2^1. 

a. ) Is it fuh? 

b. ) Is it faithful? 



Exercise 4.3.4.14. Let denote the empty category, and let C be any category. There is 
a unique functor F: ^ C. 

a. ) For general C will F be full? 

b. ) For general C will F be faithful? 

c. ) For general C will F be an equivalence of categories? 



Proposition 4.3.4.15. Let C and C he categories and let F: C ^ C he an equivalence 
of categories. Then F is fully faithful. 
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Proof. Suppose F is an equivalence, so we can find a functor F' : C ^ C and natural 
isomorphisms a\ idc ^ F' o F and a' : idc ^ F o F' . We need to know that for any 
objects c, d G Ob(C), the map 

Homi7^(c, d) : Homc(c, d) Homc/(Fc, Fd) 

is bijective. Consider the following diagram 

Homi?(c,d) 

Homc(c, d) ^ Rome [Fc, Fd) 




UomF{F'Fc,F'Fd) 



The fact that a is bijective implies that the vertical function is surjective. The fact that 
a' is bijective implies that the vertical function is injective, so it is bijective. This implies 
that Homi?(c, is bijective as well. 

□ 

Exercise 4.3.4.16. Let Z2 be the group (as category) from Example 4.3.4.7. Are there 
any fully faithful functors Z2 1? 



4.4 Categories and schemas are equivalent. Cat ^ Sch 

Perhaps it is intuitively clear that schemas are somehow equivalent to categories, and in 
this section we make that precise. The basic idea was already laid out in Section 4.2.2. 

4.4.1 The category Sch of schemas 

Recall from Definition 3.5.2.6 that a schema consists of a pair C := (G, where G = 
{V, A, src^tgt) is a graph and ^ is a categorical path equivalence relation (or CPER), 
meaning a kind of equivalence relation on the paths in G (see Definition 3.5.2.3. If 
we think of a schema as being analogous to a category, what should fulfill the role of 
functors? That is, what are to be the morphisms in Sch? 

Unfortunately, ones first guess may give the wrong notion if we want an equivalence 
Sch ^ Cat. Since objects in Sch are graphs with additional structure, one might imagine 
that a morphism C ^ C in Sch should be a graph homomorphism (as in Definition 
3.3.3.1) that preserves said structure. But graph homo morphisms require that arrows be 
sent to arrows, whereas we are more interested in paths than in individual arrows — the 
arrows are merely useful for presentation. 

If instead we define morphisms between schemas to be maps that send paths in C to 
paths in subject to the requirements that path endpoints, path concatenations, and 
path equivalences are preserved, this will turn out to give the correct notion. And since 
a path is a concatenation of its arrows, it suffices to give a function F from the arrows of 
C to the paths of C, which automatically takes care of the first two requirements above; 
we must only take care that F preserves path equivalences. 
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Recall from Examples 4.1.2.19 and 4.3.1.6 the paths-graph functor Paths: Grph 
Grph, the paths of paths functor Paths o Paths: Grph Grph, and the natural 
transformations for any graph G, 

m-G^ Paths(G) and ' Paths(Paths(G)) ^ Paths(G). (4.13) 

The function rjc spells out the fact that every arrow in G counts as a path in G, and the 
function jiQ spells out the fact that a head-to-tail sequence of paths (a path of paths) in 
G can be concatenated to a single path in G. 

Exercise 4.4.1.1. Let [2] denote the linear order graph of length 2 (see Exercise 4.1.1.8 
and Proposition 4.1.2.6), and let Coop denote the unique graph having one vertex and 
one arrow (pictured in Diagram (3.15)). 

a. ) Find an injective graph homomorphism /: [2] ^ Paths(£oop). 

b. ) The graph [2] has 6 paths, so Paths([2]) has 6 arrows. What are the images of these 

arrows under the graph homomorphism Paths(/) : Paths([2]) Paths(Paths(£oop))? 



We are almost ready to give the definition of schema morphism, but before we do, let's 
return to our original idea. Given graphs G, G' (underlying schemas C^C) we originally 
wanted a function from the paths in G to the paths in G^ but we realized it was more 
concise to speak of a function from arrows in G to paths in G^ How do we get back 
what we originally wanted from the concise version? Given a graph homomorphism 
/: G ^ Paths(G'), we use (4.13) to form the following composition, which we denote 
simply by Paths/ : Paths(G) ^ Paths(GO: 

Paths(G) Paths(Paths(GO) — Paths(GO (4.14) 

This says that given a function from arrows in G to paths in G^ a path in G becomes 
a path of paths in G^ which can be concatenated to a path in G' . This simply and 
precisely spells out our intuition. 

Definition 4.4.1.2. Let G = {V, A, src.tgt) and G' = {V , A' , src' ,tgt') be graphs, and 
let C = (G, ^g) and C = (G^ ^g') be schemas. A schema morphism F from C to P, 
denoted F: C ^ V is a graph homomorphism 

F: G^Paths(GO 

that satisfies the following condition for any paths p and q in G: 

if p ^G Q then Paths/ (p) ^g' Paths/ (g). (4.15) 

Two schema morphisms E^F: C C are considered identical if they agree on vertices 
(i.e. ^0 = Fq) and if, for every arrow / in G, there is a path equivalence in G' 

We now define the category of schemas^ denoted Sch, to be the category whose objects 
are schemas as in Definition 3.5.2.6 and whose morphisms are schema morphisms defined 
as above. The identity morphism on schema C = (G, ^g) is the schema morphism 
idc '= r]G' G ^ Paths(G) as defined in Equation (4.13). 

-•^^By Definition 3.3.3.1, a graph homomorphism F : G ^ Paths(C) will consist of a vertex part 
Fq: V and an arrows part Fi: E ^ Path(GO. See also Definition 3.3.2.1. 
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Slogan 4A.1.3. 

" A schema morphism sends vertices to vertices, arrows to paths, and path 
equivalences to path equivalences. " 

Example 4.4.1.4. Let [2] be the linear order graph of length 2, pictured to the left, and 
let C denote the schema pictured to the right below: 



[2]: = 



C : = 




We impose on C the path equivalence declaration [g^h] ^ [i] and show that in this case 
C and [2] are isomorphic in Sch. We have a schema morphism i^: [2] C sending 
^ a, 1 ^ 6, 2 ^ c, and sending each arrow in [2] to an arrow in C And we have 
a schema morphism F' : C ^ [2] which reverses this mapping on vertices; note that 
F' must send the arrow i in C to the path [/i,/2] in [2], which is ok! The roundtrip 
F' o F:[2\ ^ [2] is identity. The roundtrip F o F' : C ^ C may look like it's not 
the identity; indeed it sends vertices to themselves but it sends i to the path \g^h\. 
But according to Definition 4.4.1.2, this schema morphism is considered identical to idc 
because there is a path equivalence \dc{i) = [i] ^ [g^h] = F o F'{i). 
Exercise 4.4.1.5. Consider the schema [2] and the schema C pictured above, except where 
this time we do not impose any path equivalence declarations on C, so [^, h\ [i\ in our 
current version of C. 

a. ) How many schema morphisms are there [2] C that send to a? 

b. ) How many schema morphisms are there C [2] that send a to 0? 



Exercise 4.4.1.6. Consider the graph jCoop pictured below 



jCoop : = 



and for any natural number n, let Cn denote the schema {Coop^ ^n) where is the 
PED - This is the "finite hierarchy" schema of Example 3.5.2.11. Let 1 denote 
the graph with one vertex and no arrows; consider it as a schema. 

a. ) Is 1 isomorphic to Li in Sch? 

b. ) Is it isomorphic to any (other) L^? 



Exercise 4.4.1.7. Let Coop and Cn be the schemas defined in Exercise 4.4.1.6. 

a. ) What is the cardinality of the set Homsch(>C3, >C5)? 

b. ) What is the cardinality of the set llomsch{^5^ Cs)7 Hint: the cardinality of the set 

Homsch(>C4,>C9) is 8. 
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4.4.2 Proving the equivalence 

Construction 4.4.2.1 (From schema to category). We will define a functor L: Sch 
Cat. Let C = (G, ^) be a categorical schema, where G = (V, A, src, tgt). Define L{C) to 
be the category with Ob(L(C)) = and with Hom^(c)('Ui, ^'2) PathG'(i', k;)/ ^, i.e. 
the set of paths in G, modulo the path equivalence relation for C. The composition of 
morphisms is defined by concatenation of paths, and Lemma 3.5.2.5 ensures that such 
composition is well-defined. We have thus defined L on objects of Sch. 

Given a schema morphism F: C C ^ where C = (G^^^), we need to produce 
a functor L{F): L{C) L{C'). The objects of L{C) and L(C') are the vertices of G 
and G' respectively, and F provides the necessary function on objects. Diagram (4.14) 
provides a function Paths^?: Paths(G) Paths(G') will provide the requisite function 
for morphisms. 

A morphism in L{C) is an equivalence class of paths in C. For any representative path 
p G Paths(G), we have Pathsi7(j9) g Paths(G'), and Hp ^ q then Pathsi?(j9) ^' VdXhsF^q) 
by condition 4.15. Thus Pathsi? indeed provides us with a function YioiaL(^c) ~^ Homj^(c/). 
This defines L on morphisms in Sch. It is clear that L preserves composition and 
identities, so it is a functor. 

Construction 4.4.2.2 (From category to schema). We will define a functor R: Cat 
Sch. Let C = ( Ob (C), Home, dom, cod, id, o) be a category (see Exercise 4.1.1.18). Let 
R{C) = (G, ^) where G is the graph 

G = ( Ob (C), Home, dom, cod), 

and with ^ defined as the CPER generated by the following path equivalence decla- 
rations: for any composable sequence of morphisms /i,/2,...,/n (with dom{fi-^i) = 
cod{fi) for each l^z^n — l)we put 

[/l,/2,...,/n] ^ [/n0---0/20/i]. (4.16) 

This defines R on objects of Cat. 

A functor F: C ^ V induces a schema morphism R{F): R{C) R{^)-, because 
vertices are sent to vertices, arrows are sent to arrows (as paths of length 1), and path 
equivalence is preserved by (4.16) and the fact that F preserves the composition formula. 
This defines R on morphisms in Cat. It is clear that R preserves compositions, so it is 
a functor. 

Theorem 4.4.2.3. The functors 

L: Sch 1 ^ Cat : R 

are mutually inverse equivalences of categories. 

Sketch of proof It is clear that there is a natural isomorphism a: idcat ^ L o R; i.e. 
for any category C, there is an isomorphism C = L{R{C)). 

Before giving an isomorphism /3: idsch ^ R o we briefly describe R{L{S)) =: 
(G^ ^0 for a schema S = (G, ^). Write G = (F, A, src, tgt) and G' = (V^ A^ src\ tgt'). 
On vertices we have V = V . On arrows we have A' = Pathc/ ^. The CPER ^' for 
R{L{S)) is imposed in (4.16). Under every path of paths in G is made equivalent to 
its concatenation, considered as a path of length 1 in G^ 

There is a natural transformation p : idsch Ro L whose <S-component sends each 
arrow in G to a certain path of length 1 in G^ We need to see that (3s has an inverse. 
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But this is straightforward: every arrow f in Ro L{S) is an equivalence class of paths 
in 5; choose any one and send / there; by Definition 4.4.1.2 any other choice will give 
the identical morphism of schemas. It is easy to show that the roundtrips are identities 
(again up to the notion of identity given in Definition 4.4.1.2). 

□ 

4.5 Limits and colimits 

Limits and colimits are universal constructions, meaning they represent certain ideals of 
behavior in a category. When it comes to sets that map to A and 5, the {A x 5)-grid is 
ideal — it projects on to both A and B as straightforwardly as possible. When it comes 
to sets that can interpret the elements of both A and the disjoint union A u B is 
ideal — it includes both A and B without confusion or superfiuity. These are limits and 
colimits in Set. Limits and colimits exist in other categories as well. 

Limits in a preorder are meets, colimits in a preorder are joins. Limits and colimits 
also exist for database instances and monoid actions, allowing us to discuss for example 
the product or union of different state machines. Limits and colimits exist for spaces, 
giving rise to products and unions, as well as quotients. 

Limits and colimits do not exist in every category; when C is complete with respect 
to limits (or colimits), these limits always seem to mean something valuable to human 
intuition. For example, when a subject has already been studied for a long time before 
category theory came around, it often turns out that classically interesting constructions 
in the subject correspond to limits and colimits in C. For example products, unions, 
equivalence relations, etc. are classical ideas in set theory that are naturally captured 
by limits and colimits in Set. 

4.5.1 Products and coproducts in a category 

In Sections 2.4, we discussed products and coproducts in the category Set of sets. Now 
we discuss the same notions in an arbitrary category. For both products and coproducts 
we will begin with examples and then write down the general concept, but we'll work on 
products first. 

4.5.1.1 Products 

The product of two sets is a grid, which projects down onto each of the two sets. This 
is good intuition for products in general. 

Example 4.5.1.2. Given two preorders, A'l := (Xi, ^i) and A2 := (X2, ^2), we can take 
their product and get a new preorder A'l x A'2. Both A'l and have underlying sets 
(namely Xi and X2), so we might hope that the underlying set of A'l x is the set 
Xi X X2 of ordered pairs, and this turns out to be true. We have a notion of less-than on 
A'l and we have a notion of less-than on A2 ; we need to construct a notion of less-than 
on A'l X A2. So, given two ordered pairs (xi,X2) and (x'^,X2), when should we say that 
(xi,X2) ^1,2 (^17^2) holds? The obvious guess is to say that it holds iff both xi x[ 
and X2 ^2 ^2 hold, and this works: 

A'l xX2:={Xi xX2,^i,2) 

Note that the projection functions Xi x X2 Xi and Xi x X2 X2 induce 
morphisms of preorders. That is, if (xi,X2) ^1,2 {x[^X2) then in particular xi ^ x[. So 
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Exercise 4.5.1.3. Suppose that you have a partial order (S*, ^5) on songs (so you know 
some songs are preferable to others but sometimes you can't compare). And suppose 
you have a partial order (A, ^a) on pieces of art. You're about to be given a pair (s, a) 
including a song and a piece of art. Does the product partial order S x A provide a 
reasonable guess for your preferences on pairs? 

Exercise 4.5.1.4. Consider the partial order ^ on M given by standard "less-than-or- 
equal-to", so 5 ^ 9 etc. And consider another partial order, divides on M, where 
a divides b if "a goes into b evenly", i.e. if there exists n G M such that a * n = 6, so 
5 divides 35. If we call the product order (X, <) := (N, ^) x (M, divides), which of 
the following are true: 

(2, 4) < (3, 4)? (2, 4) < (3, 5)? (2, 4) < (8, 0)? (2, 4) < (0, 0)? 



Example 4.5.1.5. Given two graphs Gi = (Vi, Ai, srci, t^ti) and G2 = {V2^ src2^tgt2)^ 
we can take their product and get a new graph Gi x G2. The vertices will be the grid 
of vertices Vi x V2, so each vertex in Gi x G2 is labeled by a pair of vertices, one from 
Gi and one from G2. When should an arrow connect {vi^V2) to {v[^V2)^. Whenever we 
can find an arrow in Gi connecting vi to v'^ and we can find an arrow in G2 connecting 
V2 to It turns out there is a simple formula for the set of arrows in Gi x G2, namely 
Ai X A2. 

Let's write G := Gi x G2 and say G = (F, A, src, tgt). We now know that V = V1XV2 
and A = Ai x A2. What should the source and target functions A ^ V be? Given a 
function srci : Ai ^ Vi and a function src2 : A2 ^ ¥2^ the universal property of products 
in Set (Lemma 2.4.1.10) provides a unique function 

src := srci x src2 : Ai x A2 ^ Vi x V2 

Namely the source of arrow (ai,a2) will be the vertex {srci{ai)^ src2{a2)). Similarly we 
have a ready-made choice of target function tgt = tgti x tgt2- We have now defined the 
product graph. 
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Here's a concrete example. Let / and J be as drawn below: 



/ : = 




Arrow (/) 


ID 


src 


tgt 


/ 


V 


w 


9 


w 


X 


h 


w 


X 



Vertex (/) 



ID 




Arrow (J) 


ID 


src 


tgt 


i 


q 


r 


3 


r 


s 


k 


s 


r 


i 


s 


t 



Vertex (J) 



ID 



The product I x J drawn below has, as expected 3*4 = 12 vertices and 3*4 = 12 arrows: 



I X J :-- 











• 


• 


• 


• 










{w,q) 


{w,r) 


(w,s) 




• 






• 


{x,q) 








• 


• 


• 


• 



Arrow (I X J) 


ID 


src 


tgt 








(/, j) 


(t; , r) 


( w , s) 


(/, fc) 






(/,^) 


(t; , s) 


( w , t) 


(<?,i) 


(w, q) 




(9, j) 






(<?, fc) 


, s) 


(a3 , r) 






(x, t) 


(h, i) 


(w , q) 


(x , r) 


j) 






(h,, fc) 


, s) 


(x , r) 






(x, t) 



Vertex (J X J) 



Here is the most important thing to notice. Look at the Arrow table for / x J, 
and for each ordered pair, look only at the second entry in all three columns; you will 
see something that matches with the Arrow table for J. Do the same for /, and again 
you'll see a perfect match. These "matchings" are readily-visible graph homomorphisms 
I X J ^ I and / X J ^ J in Grph. 



0/1 

Exercise 4.5.1.6. Let [1] = • — >• be the linear order graph of length 1 and let P 
Paths([l]) be its paths-graph, as in Example 4.1.2.19 (so P should have three arrows 
and two vertices). Draw the graph P x P. 

Exercise 4.5.1.7. Recall from Example 3.5.2.9 that a discrete dynamical system (DDS) 
is a set s together with a function /: s ^ s. By now it should be clear that if 



Coop : = 




is the loop schema, then a DDS is simply an instance (a functor) /: Coop Set. We 
have not yet discussed products of DDS's, but perhaps you can guess how they should 
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work. For example, consider the instances /, J: Coop Set tabulated below: 



s (I) 


ID 


f 


A 


c 


B 


c 


C 


c 



s (J) 


ID 


f 


X 


y 


y 


X 


z 


z 



a. ) Make a guess and tabulate I x J. Then draw it.^^ 

b. ) Recall the notion of natural transformations between functors (see Example 4.3.3.4), 

which in the case of functors Coop Set are the morphisms of instances. Do you see 
clearly that there is a morphism of instances I x J ^ I and I x J ^ Jl Just check 
that if you look only at the left-hand coordinates in your I x you see something 
compatible with /. 



In every case above, what's most important to recognize is that there are projection 
maps IxJ^I and IxJ^ J, and that the construction of / x J seems as straightforward 
as possible, subject to having these projections. It is time to give the definition. 

Definition 4.5.1.8. Let C be a category and let X, y g Ob(C) be objects. A span on X 
and Y consists of three constituents (Z^p^q)^ where Z e Ob(C) is an object, and where 
p: Z ^ X and q: Z ^Y are morphisms in C. 



X 



Y 



A product of X and F is a span X X xY - 
X <^ Z Y there exists a unique morphism tp^q 
below commutes: 

X xY 

' A 

TTl / I \ 7r2 



^ y, such that for any other span 
Z ^ X X Y such that the diagram 




Remark 4.5.1.9. Definition 4.5.1.8 endows the product of two objects with something 
known as a universal property. It says that a product of two objects X and Y maps to 
those two objects, and serves as a gateway for all who do the same. "None shall map 
to X and Y except through me!" This grandiose property is held by products in all the 
various categories we have discussed so far. It is what I meant when I said things like 



-•^^The result is not necessarily inspiring, but at least computing it is straightforward. 

^^The names X xY and 7ri,7r2 are not mathematically important, they are pedagogically suggestive. 
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"X X Y maps to both X and Y and does so as straightforwardly as possible". The grid of 
dots obtained as the product of two sets has such a property, as was shown in Example 
2.4.1.11. 

Example 4.5.1.10. In Example 4.5.1.2 we discussed products of preorders. In this example 
we will discuss products in an individual preorder. That is, by Proposition 4.2.1.17, there 
is a functor PrO Cat that realizes every preorder as a category. If V = (P, ^) is 
a preorder, what are products in V7 Given two objects a, 6 g Oh{V) we first consider 
spans on a and 6, i.e. a ^ z ^ b. That would be some z such that z ^ a and z ^ b. The 
product will be such a span a ^ a x b ^ b, but such that every other spanning object z 
is less than or equal to a x 6. In other words a x 6 is as big as possible subject to the 
condition of being less than a and less than b. This is precisely the meet of a and b (see 
Definition 3.4.2.1). 

Example 4.5.1.11. Note that the product of two objects in a category C may not exist. 
Let's return to preorders to see this phenomenon. 

Consider the set R^, and say that (xi^yi) ^ (^2,^2) if there exists £ ^ 1 such that 
xi£ = X2 and yi£ = y2] in other words, point p is less than point q if, in order to travel 
from q to the origin along a straight line, one must pass through p along the way. 
We have given a perfectly good partial order, but p := (1,0) and q := (0,1) do not 
have a product. Indeed, it would have to be a non-zero point that was on the same 
line-through-the origin as p and the same line-through-the-origin as g', of which there are 
none. 

Example 4.5.1.12. Note that there can be more than one product of two objects in a 
category C, but that any two choices will be canonically isomorphic. Let's return once 
more to preorders to see this phenomenon. 

Consider the set M? and say that {xi^yi) ^ (^2,^2) if ^1 -\- yf ^ + ^2^ i^ other 
words if the former is on a smaller 0-circle (by which I mean "circle centered at the 
origin") than the latter is. 

For any two points p, q there will be lots of points that serve as products: anything 
on the smaller of their two 0-circles will suffice. Given any two points p, q on this smaller 
circle, we will have a unique isomorphism p = q because p ^ q and q ^ p and all 
morphisms are unique in a preorder. 

Exercise 4.5.1.13. Consider the preorder V of cards in a deck, shown in Example 3.4.1.3; 
it is not the entire story of cards in a deck, but take it to be so. In other words, be like 
a computer and take what's there at face value. Consider the preorder P as a category 
(by way of the functor PrO Cat). 

a. ) For each of the following pairs, what is their product in V (if it exists)? 

■^a diamond^ x "^a heart^ ? "^a queen^ x "^a black card^ ? 

"^a card^ x "^a red card^ ? "^a face card^ x "^a black card^ ? 

b. ) How would these answers differ if V was completed to the "whole story" partial order 

classifying cards in a deck? 



Exercise 4.5.1.14. Let X be a set, and consider it as a discrete category. Given two 
objects x^y E Ob(X), under what conditions will there exist a product x x y7 



^■"^Note that (0,0) is not related to anything else. 
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Exercise 4.5.1.15. Let / : R ^ R be a function, like you would see in 6th grade (maybe 
f{x) = X -\-7). A typical thing to do is to graph / as a curve running through the plane 
R^ := R X R. This curve can be understood as a function F: R — > R^. 

a. ) Given some x g R, what are the coordinates of F{x) eM?7 

b. ) Obtain F: R ^ R^ using the universal property given in Definition 4.5.1.8. 



Exercise 4.5.1.16. Consider the preorder (M, divides), discussed in Exercise 4.5.1.4, 
where e.g. 5 ^ 15 but 5^6. 

a. ) What is the product of 9 and 12 in this category? 

b. ) Is there a standard name for products in this category? 



Example 4.5.1.17. All products exist in the category Cat. Given two categories C and 
P, there is a product category C x V. We have Ob(C x V) = Ob(C) x Oh{V) and for 
any two objects (c, and {c'^d')^ we have 

Homcxi^CCc, (i), (c^, (i^) ^ Homc(c, c') x Homc((i, (i^). 

The composition formula is "obvious". 

Let [1] G Ob(Cat) denote the linear order category of length 1, drawn 



[1]: = 



As a schema it has one arrow, but as a category it has three morphisms. So we expect 
[1] X [1] to have 9 morphisms, and that's true. In fact, [1] x [1] looks like a commutative 
square: 



(0,0) idox/ (0,1) 
• ^ • 



/xido 

(1,0) 



/xidi 

Y 

(1,1) 



idi x/ 



We see only four morphisms here, but there are also four identities and one morphism 
(0,0) (1,1) given by composition of either direction. It is a minor miracle that the 
categorical product somehow "knows" that this square should commute; however, this is 
not the mere preference of man but instead the dictate of God! By which I mean, this 
follows rigorously from the definitions we already gave of Cat and products. 



4.5.1.18 Coproducts 

The coproduct of two sets is their disjoint union, which includes non-overlapping copies 
of each of the two sets. This is good intuition for coproducts in general. 

Example 4.5.1.19. Given two preorders, A'l := (Xi, ^i) and ^2 -= (-^2, ^2), we can take 
their coproduct and get a new preorder A'l u X2. Both A'l and have underlying sets 
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(namely Xi and X2), so we might hope that the underlying set of A'l x is the set 
Xi u X2 consisting of their disjoint union, and that turns out to be true. We have a 
notion of less-than on A'l and we have a notion of less-than on A2; given an element 
X E Xi u X2 and an element x' e Xi u X2, when should we say that x ^iu2 holds? 
The obvious guess is to say that it holds iff somebody says it does; that is, either both 
X and x' are from the same home (i.e. x e Xi and x' g Xi OR x e X2 and x' e X2) or 
they are from different homes (i.e. x e Xi and x^ e X2 OR x e X2 and x' e Xi), and 
this is correct: 

X1UX2 := (Xi uX2,^iu2) 

Note that the inclusion functions Xi Xi 1AX2 and X2 Xi 1AX2 induce morphisms 
of preorders. That is, if x^x' e Xi are elements such that x ^1 xMn A'l then the same 
will hold in A'l u A2. So we have preorder morphisms 



A'l u X2 




Xi X2 

Exercise 4.5.1.20. Suppose that you have a partial order A := (A, on apples (so 
you know some apples are preferable to others but sometimes you can't compare). And 
suppose you have a partial order O := (O, ^o) 011 oranges. You're about to be given 
two pieces of fruit from a basket of apples and oranges. Is the coproduct partial order 
Au O di reasonable guess for your preferences, or does it seem biased? 

Example 4.5.1.21. Given two graphs Gi = (Vi, Ai, srci, t^ti) and G2 = (^2, ^2, '5rc2, ^^^2), 
we can take their coproduct and get a new graph Gi u G2. The vertices will be the dis- 
joint union of vertices Vi uV2^ so each vertex in Gi x G2 is labeled either by a vertex 
in Gi or by one in G2 (and if any labels are shared, then something must be done to 
differentiate them). When should an arrow connect v to Whenever both are from the 
same component (i.e. either v' e Vi or v' EV2) and we can find an arrow connecting 
them in that component. It turns out there is a simple formula for the set of arrows in 
Gi u G2, namely Ai u A2. 

Let's write G := Gi u G2 and say G = (F, A, src, tgt). We now know that V = V1UV2 
and A = Ai u A2. What should the source and target functions A ^ V be? Given 
a function srci : Ai Vi and a function 5rc2 : A2 V2, the universal property of 
coproducts in Set (Lemma 2.4.2.7) provides a unique function 

src := srci u src2 : Ai u A2 ^ Vi u V2. 

Namely for any arrow a G A, we know either a G Ai or a G ^2 (and not both), so the 
source of a will be the vertex srci{a) if a g Ai and src2{a) if a g A2. Similarly we 
have a ready-made choice of target function tgt = tgti u tgt2. We have now defined the 
coproduct graph. 
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/ : = 




Arrow (/) 


ID 


src 


tgt 


/ 


V 


w 


9 


w 


X 


h 


w 


X 



Vertex (/) 



ID 



J: = 




Arrow (J) 


ID 


src 


tgt 


i 


Q 


r 


3 


r 


s 


k 


s 


r 


i 


s 


t 



Vertex (J) 



ID 



The coproduct I u J drawn below has, as expected 3 + 5 
arrows: 



vertices and 3 + 4 



luJ:-- 




Arrow (/ u J) 


ID 


src 


tgt 


/ 


V 


w 


9 


w 


X 


h 


w 


X 


i 


Q 


r 


3 


r 


s 


k 


s 


r 


i 


s 


t 



Vertex (/ u J) 



ID 



w 

X 



Here is the most important thing to notice. Look at the Arrow table / and notice 
that there is a way to send each row to a row in / u J, such that all the foreign keys 
match. Similarly in the arrow table and the two vertex tables for J. These "matchings" 
are readily- visible graph homomorphisms I ^ I u J and J ^ / u J in Grph. 

Exercise 4.5.1.22. Recall from Example 3.5.2.9 that a discrete dynamical system (DBS) 
is a set s together with a function /: s ^ s; if 



Coop : = 




is the loop schema, then a DBS is simply an instance (a functor) /: Coop Set. We 
have not yet discussed coproducts of BBS's, but perhaps you can guess how they should 
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work. For example, consider the instances /, J: Coop Set tabulated below: 



s (I) 


ID 


f 


A 


c 


B 


c 


C 


c 



s (J) 


ID 


f 


X 


y 


y 


X 


z 


z 



Make a guess and tabulate I u J. Then draw it. 

In every case above (preorders, graphs, DDSs), what's most important to recognize 
is that there are inclusion maps I ^ I u J and J ^ / u J, and that the construction of 
I u J seems as straightforward as possible, subject to having these inclusions. It is time 
to give the definition. 

Definition 4.5.1.23. Let C be a category and let X^Y e Ob(C) be objects. A cospan 
on X and Y consists of three constituents (Z^i^j), where Z e Ob(C) is an object, and 
where i : X ^ Z and j : Y ^ Z are morphisms in C. 



X 



A coproduct of X and F is a cospan X-^XuF^F, such that for any other 



cospan X ^ Z <^ Y there exists a unique morphism Sij : 
diagram below commutes: 

X uY 



X uY 



Z such that the 




Remark 4.5.1.24. Definition 4.5.1.8 endows the coproduct of two objects with a universal 
property. It says that a coproduct of two objects X and Y receives maps from those two 
objects, and serves as a gateway for all who do the same. "None shall receive maps from 
X and Y except through me!" This grandiose property is held by all the coproducts we 
have discussed so far. It is what I meant when I said things like "X u F receives maps 
from both X and F and does so as straightforwardly as possible". The disjoint union 
of dots obtained as the coproduct of two sets has such a property, as can be seen by 
thinking about Example 2.4.2.5. 

Example 4.5.1.25. By Proposition 4.2.1.17, there is a functor PrO Cat that realizes 
every preorder as a category. If 7^ = (P, ^) is a preorder, what are coproducts in V7 
Given two objects a, 6 e Oh{V) we first consider cospans on a and 6, i.e. a ^ z ^ b. 



The names X uY and li, L2 are not mathematically important, they are pedagogically suggestive. 
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A cospan of a and b is any z such that a ^ z and b ^ z. The coproduct wih be such a 
cospan a ^ a u 6 ^ 6, but such that every other cospanning object z is greater than or 
equal to a u 6. In other words a u 6 is as smah as possible subject to the condition of 
being bigger than a and bigger than b. This is precisely the join of a and b (see Definition 
3.4.2.1). 

Just as for products, the coproduct of two objects in a category C may not exist, or it 
may not be unique. The non-uniqueness is much less "bad" because given two candidate 
coproducts, they will be canonically isomorphic. They may not be equal, but they are 
isomorphic. But coproducts might not exist at all in certain categories. We will explore 
that a bit below. 

Example 4.5.1.26. Consider the set and partial order from Example 4.5.1.11 where 
{xi^yi) ^ (^2,^2) if there exists i ^ 1 such that xii = X2 and yii = y2. Again the 
points p := (1,0) and q := (0,1) do not have a coproduct. Indeed, it would have to 
be a non-zero point that was on the same line-through-the origin as p and the same 
line-through-t he-origin as (7, of which there are none. 

Exercise 4.5.1.27. Consider the preorder V of cards in a deck, shown in Example 3.4.1.3; 
it is not the entire story of cards in a deck, but take it to be so. In other words, be like a 
computer and take what's there at face value. Consider the preorder 7^ as a category (by 
way of the functor PrO Cat). For each of the following pairs, what is their coproduct 
in V (if it exists)? 

■^a diamond^ u "^a heart^ ? "^a queen^ u "^a black card^ ? 

■^a card^ u "^a red card^ ? "^a face card^ u "^a black card^ ? 

How would these answers differ if V was completed to the "whole story" partial order 
classifying cards in a deck? 

Exercise 4.5.1.28. Let X be a set, and consider it as a discrete category. Given two 
objects x^y E Ob(X), under what conditions will there exist a coproduct x u y? 

Exercise 4.5.1.29. Consider the preorder (M, divides), discussed in Exercise 4.5.1.4, 
where e.g. 5 ^ 15 but 5^6. 

a. ) What is the coproduct of 9 and 12 in that category? 

b. ) Is there a standard name for coproducts in that category? 



4.5.2 Diagrams in a category 

We have been drawing diagrams since the beginning of the book. What is it that we 
have been drawing pictures of? The answer is that we have been drawing functors. 

Definition 4.5.2.1. Let C and / be categories. '^^ An I-shaped diagram in C is simply 
a functor d: I ^ C. In this case / is called the indexing category for the diagram. 



^^In fact, the indexing category / is usually assumed to be small in the sense of Remark 4.1.1.2, 
meaning that its collection of objects is a set. 
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Example 4.5.2.2. Consider the commutative diagram in Set drawn below: 

-1 




(4.17) 



This is the drawing of a functor d: [1] x [1] Set. With notation for the objects and 
morphisms of [1] x [1] as shown in Example 4.5.1.17, we have (i(0, 0) = (i(0, 1) = (i(l, 0) = 
M and <i(l, 1) = Z (for some reason..) and (i(ido, /) : M ^ M given by n n + 1, etc. 

The fact that is a functor means it must respect composition formulas, which implies 
that this diagram commutes. Recall from Section 2.2 that not all diagrams one can draw 
will commute; one must specify that a given diagram commutes if he or she wishes to 
communicate this fact. But then how is a non-commuting diagram to be understood as 
a functor? 

Let G E Ob(Grph) denote the following graph 



(0,0) 



(1,0) 



(0,1) 



Y 

(1,1) 



Recall the free category functor F: Grph Cat from Example 4.1.2.27. The free 
category F{G) e Ob(Cat) on G looks almost like [1] x [1] except that since [/, ^] is a 
different path in G than is they become different morphisms in F{G). A functor 

F{G) Set might be drawn the same way that (4.17) is, but it would a diagram that 
would not be said to commute. 

We call [1] X [1] the commutative square indexing category. 

Exercise 4.5.2.3. Consider [2], the linear order category of length 2. 

a. ) Is [2] the appropriate indexing category for commutative triangles. 

b. ) If not, what is? 



Example 4.5.2.4. Recall that an equalizer in Set was a diagram of sets that looked like 
this: 



/ 



91 



(4.18) 



where gio f = g2^ f - What is the indexing category for such a diagram? It is the schema 
(4.18) with the PED [/, ^i] ^ [/, ^2]- That is, in some sense you're seeing the indexing 
category, but the PED needs to be declared. 

Exercise 4.5.2.5. Let C be a category, A e Ob(C) an object, and /: A ^ A morphism 
in C Consider the two diagrams in C drawn below: 



A f A f A f 

• ^ • ^ • ^ • • 



/ 



^We might call what is here denoted by F{G) the noncommutative square indexing category. 
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a. ) Should these two diagrams have the same indexing category? 

b. ) If they should have the same indexing category, what is causing or allowing the 

pictures to appear different? 

c. ) If they should not have the same indexing category, what coincidence makes the two 

pictures have so much in common? 







Definition 4.5.2.6. Let / g Ob(Cat) be a category. The left cone on /, denoted is 
the category defined as follows. On objects we put Ob(/^) = {— oo} u Ob(/), and we call 
the new object — oo the cone point of I^. On morphisms we add a single new morphism 
: — 00 ^ 6 for every object b E Ob(/); more precisely. 



Hom/^ (a, b) 



^Hom/(a, b) 
{id-00} 



if a, 6 E Ob(/) 

if a = -00, b e Ob(/) 

if a = b = —00 

if aG0b(/),6= -00. 



The composition formula is in some sense obvious. To compose two morphisms both in 
/, compose as dictated by /; if one has —00 as source then there will be a unique choice 
of composite. 

There is an obvious inclusion of categories I ^ I^. 

Remark 4.5.2.7. Note that the specification of given in Definition 4.5.2.6 works just 
as well if / is considered a schema and we are constructing a schema add the new 
object —00 and the new arrows 55 : — 00 ^ 6 for each b e Ob(/), and for every morphism 
f:b^b'm.I add a PED [55/] ^ [55,/]. We generally will not distinguish between 
categories and schemas, since they are equivalent. 

Example 4.5.2.8. For a natural number n g M, we define the n -leaf star schema., denoted 
Star^, to be the category (or schema, see Remark 4.5.2.7) n^, where n is the discrete 
category on n objects. Below we draw Staro, Stari, Star2, and Stars. 




Exercise 4.5.2.9. Let Co := denote the empty category and for any natural number 
n 6 N, let Cn+i = (Cn)^. Draw C4. 

Exercise 4.5.2.10. Let C be the graph indexing schema as in (4.7). What is and how 
does it compare to (4.18)? 

Definition 4.5.2.11. Let / g Ob(Cat) be a category. The right cone on /, denoted 
is the category defined as follows. On objects we put Ob(/^) = Ob(/) u {00}, and we call 
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the new object oo the cone point of I^. On morphisms we add a single new morphism 
4 : 6 ^ 00 for every object b e Ob(/); more precisely, 



Hom/^(a, b) 



^ Horn/ (a, 6) if a, 6 g Ob(/) 

{h} ifaeOh{I),b = 00 

{idoo} if a = 6 = 00 

[0 if a = oo,6g Ob(/). 



The composition formula is in some sense obvious. To compose two morphisms both in 
/, compose as dictated by /; if one has oo as target then there will be a unique choice of 
composite. 

There is an obvious inclusion of categories I ^ I^. 

Exercise 4.5.2.12. Let C be the category (2^)^, where 2 is the discrete category on two 
objects. Then C is somehow square-shaped, but what category is it exactly? Looking 
at Example 4.5.2.2, is C the commutative diagram indexing category [1] x [1], is it the 
non- commutative diagram indexing category F{G)^ or is it something else? 



4.5.3 Limits and colimits in a category 

Let C be a category, / be an indexing category (which just means a category that we're 
about to use as the indexing category for a diagram) , and D: / ^ C an /-shaped diagram 
(which just means a functor). It is in relation to this setup that we can discuss the limit 
or colimit. In general the limit of a diagram D: I ^ C will be a shaped diagram 
limD: ^ C. In the case of products 1 = 2 and = Star2 looks like a span (see 
Example 4.5.2.8). But out of all the /^-shaped diagrams, which is the product of D? 
Answer: the one with the universal "gateway" property, see Remark 4.5.1.9. 



4.5.3.1 Universal objects 

Definition 4.5.3.2. Let C be a category. An object a e Ob(C) is called initial if, for 
all objects c e Ob(C) there exists a unique morphism a ^ c, i.e. |Homc(a,c)| = 1. An 
object z E Ob(C) is called terminal if, for all objects c e Ob(C) there is exists a unique 
morphism c^ i.e. |IIomc(c, 2:)| = 1. 

An object in a category is called universal if it is either initial or terminal, but we 
rarely use that term in practice, preferring to be specific about whether the object is 
initial or terminal. The word final is synonymous with the word terminal, but we'll try 
to constantly use terminal. 

Colimits will end up being initial things of a certain sort, and limits will end up being 
terminal things of a certain sort. But we will get to that in Section 4.5.3.14. 

Example 4.5.3.3. The initial object in Set is the set a for which there is always one way 
to map from a to anything else. Given c e Ob(Set) there is exactly one function c, 
because there are no choices to be made, so the empty set is the initial object in Set. 

The terminal object in Set is the set z for which there is always one way to map to 
z from anything else. Given c E Ob (Set) there is exactly one function c {©}, where 
{©} is any set with one element, because there are no choices to be made: everything in 
c must be sent to the single element in {©}. There are lots of terminal objects in Set, 
and they are all isomorphic to 1. 
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Example 4.5.3.4. The initial object in Grph is the graph a for which there is always one 
way to map from a to anything else. Given c e Ob (Grph), there is exactly one function 
^ c, where e Grph is the empty graph; so is the initial object. 

The terminal object in Grph is more interesting. It is Coop^ the graph with one 
vertex and one arrow. In fact there are infinitely many terminal objects in Grph, but 
all of them are isomorphic to Coop. 

Proposition 4.5.3.5. Let C be a category and let ai,a2 e Ob(C) both be initial objects. 
Then there is a unique isomorphism ai ^02. (Similarly, for any two terminal objects 
in C there is a unique isomorphism between them.) 

Proof. Suppose ai and a2 are initial. Since ai is initial there is a unique morphism 
/: ai ^ a2; there is also a unique morphism ai ^ ai, which must be id^^. Since a2 is 
initial there is a unique morphism g: a2 ^ ai; there is also a unique morphism a2 ^ ^2, 
which must be ida2 • So ^ o / = id^^ and f o g = ida2 5 which means that / is the desired 
(unique) isomorphism. 

The proof for terminal objects is appropriately "dual". 

□ 

Exercise 4.5.3.6. Let X be a set, let P(X) be the set of subsets of X (see Definition 
2.7.4.9). We can regard P(X) as a preorder under inclusion of subsets (see for example 
Section 3.4.2). And we can regard preorders as categories using a functor PrO Cat 
(see Proposition 4.2.1.17). 

a. ) What is the initial object in P(X)? 

b. ) What is the terminal object in P(X)? 



Example 4.5.3.7. The initial object in the category Mon of monoids is the trivial monoid, 
1. For any monoid M, a morphism of monoids 1 ^ M is a functor between 1-object 
categories and these are determined by where they send morphisms. Since 1 has only 
the identity morphism and functors must preserve identities, there is no choice involved 
in finding a monoid morphism l^M. 

Similarly, the terminal object in Mon is also the trivial monoid, 1. For any monoid 
M, a morphism of monoids M ^ 1 sends everything to the identity; there is no choice. 

Exercise 4.5.3.8. a.) What is the initial object in Grp, the category of groups? 
b.) What is the terminal object in Grp? 



Example 4.5.3.9. Recall the preorder Prop of logical propositions from Section 4.2.4.1. 
The initial object is a proposition that implies all others. It turns out that "FALSE" is 
such a proposition. The proposition "FALSE" is like "1 ^ 1"; in logical formalism it can 
be shown that if "FALSE" is true then everything is true. 

The terminal object in Prop is a proposition that is implied by all others. It turns 
out that "TRUE" is such a proposition. In logical formalism, everything implies that 
"TRUE" is true. 

Example 4.5.3.10. The discrete category 2 has no initial object and no terminal object. 
The reason is that it has two objects 1,2, but no maps from one to the other, so when 
Hom2(l,2) = Hom2(2,l) = 0. 

Exercise 4.5.3.11. Recall the divides preorder from Exercise 4.5.1.4, where 5 divides 15. 
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a. ) Considering this preorder as a category, does it have an initial object? 

b. ) Does it have a terminal object? 



Exercise 4.5.3.12. Let M = (List({a, 6}), [ ],^) denote the free monoid on {a, 6} (see 
Definition 3.1.1.12). 

a. ) Does it have an initial object? 

b. ) Does it have a terminal object? 



Exercise 4.5.3.13. Let 6* be a set and consider the indiscrete category Ks e Ob(Cat) on 
S objects (see Example 4.3.4.3). 

a. ) Does Ks have an initial object? 

b. ) Does it have a terminal object? 



4.5.3.14 Examples of limits 

Let C be a category and let X, F g Ob(C) be objects. Definition 4.5.1.8 defines a product 
of X and F to be a span X <^ X xY Y such that for every other span X <^ Z Y 
there exists a unique morphism Z ^ X xY making the triangles commute. It turns out 
that we can enunciate this in our newly formed language of universal objects by saying 
that the span X <^ X x Y Y is itself a terminal object in the category of spans 
on X and Y. Phrasing the definition of products in this way will be generalizable to 
defining arbitrary limits. 

Construction 4.5.3.15 (Products). Let C be a category and let Xi,X2 be objects. We 
can consider this setup as a diagram X: 2 ^ C, where X(l) = Xi and X{2) = X2. 
Consider the category 2^ = Star2, which is drawn in Example 4.5.2.8, and the category 
of functors Fun(2^,C). The objects in Fun(2^,C) are spans in C and the morphisms 
are natural transformations between them. Given a functor S : 2^ C we can compose 
with i : 2 ^ 2^ to get a functor 2 ^ C. We want that to be X. 




So we are ready to define the category of spans on Xi and X2. 

Define the category of spans on X, denoted C/x? to be the category whose objects 
and morphisms are as follows: 

Oh{C/x) = {8:2^ ^C\Soi = X} (4.19) 
Homc/^ {S, S') = {a:S^S'\aoi = idx}. 

The product of Xi and X2 is the terminal object in C/x- 
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To bring this down to earth, an object in C/x looks is any diagram in C that looks 
like this: 

Z 




Xi X2 

In other words, the objects of C/x spans, each of which we might write in-line as 

Xi^ Z ^ X2. A morphism in C/x from object Xi ^ Z ^ X2 to object X^ ^ Z' ^ 
X2 consists of a morphism £: Z ^ Z' ^ such that p' o i = p and o £ = q. So morphisms 
in C/x look like diagrams as to the right: 



Z 




I Z' ) 

(4.20) 

Each object in C/x is a span on Xi and X2, and each morphism in C/x is a "morphism 
of cone points in C making everything in sight commute". The terminal object in C/x is 
the product of Xi and X2] see Definition 4.5.1.8. 

It may be strange to have a category in which the objects are spans in another 
category. But once you admit this possibility, the notion of morphism between spans 
is totally sensible. Or if it isn't, then stare at (4.20) for 30 seconds and say to yourself 
"When in Rome..!" These are the aqueducts of category theory, and they work wonders. 

Example 4.5.3.16. Consider the 6-object category C drawn below, in which the three 

^^To be completely pedantic, according to (4.19), the morphisms in C/x should be drawn like this: 



z 




Z' 



But this is going a bit overboard. 
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diagrams that can commute do: 









^ • 
























A ^ 


/ 


B / 






• — 








• ^ • 



Let X: 2 ^ C be given by X(l) = X\ and X(2) = X2. Then the category of spans on 
X might be drawn 





(^,61,62) 


(C,Ci,C2) 


9 (D^diM 




• 


• 


^ • 



4.5.3.17 Definition of limit 

Definition 4.5.3.18. Let C be a category, let / be a category; let be the left cone on 
/, and let z : / ^ be the inclusion. Suppose that X : / ^ C is an /-shaped diagram 
in C. The slice category of C over X denoted C/x is the category whose objects and 
morphisms are as follows: 

Oh{C/x) = {S: ^C \ Soi = X} 
Homc/^ {S, S') = {a:S^S'\aoi = idx}. 

A limit of X^ denoted \imj X or limX, is a terminal object in C^x- 

Fullbacks The relevant indexing category for pullbacks is the cospan, 1 = 2^ drawn 
as to the left below: 



I X:I^C 




26 



A /-shaped diagram in C is a functor X: / ^ C, which we might draw as to the right 
above (e.g. Xq e Ob(C)). 



^^We use a dash box here because we're not drawing the whole category but is merely a diagram 
existing inside C. 
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An object S in the slice category C/x is a commutative diagram S: I^ C over X, 
which looks like the box to the left below: 



SeOh{C,x) 





A morphism in Cjx is drawn in the dashbox to the right above. A terminal object in 
C/x is precisely the "gateway" we want, i.e. the limit of X is the pullback Xq -^i- 

Exercise 4.5.3.19. Let / be the graph indexing category (see 4.7). 

a. ) What is /^? 

b. ) Now let G: / ^ Set be the graph from Example 3.3.1.2. Give an example of an 

object in SetjQ. 

c. ) We have already given a name to the limit of G: I ^ Set; what is it? 



Exercise 4.5.3.20. Let C be a category and let / 
unique functor X : ^ C. 

a. ) What is the slice category C/x^ 

b. ) What is the limit of XI 



be the empty category. There is a 



Example 4.5.3.21. Often one wants to take the limit of some strange diagram. We have 
now constructed the limit for any shape diagram. For example, if we want to take the 
product of more than two, say n, objects, we could use the diagram shape I = n whose 
cone is Star^ from Example 4.5.2.8. 

Example 4.5.3.22. We have now defined limits in any category, so we have defined limits 
in Cat. Let [1] denote the category depicted 



and let C be a category. Naming two categories is the same thing as naming a functor 
X: 2 ^ Cat, so we now have such a functor. Its limit is denoted [1] x C. It turns 
out that [1] X C looks like a "C-shaped prism". It consists of two panes, front and back 
say, each having the precise shape as C (same objects, same arrows, same composition), 
and morphisms from the front pane to the back pane making all front-to-back squares 
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commute. For example, if C looked was the category generated by the schema to the left 
below, then C x [1] would be the category generated by the schema to the right below: 




It turns out that a natural transformation a: F ^ G between functors F^G\ C ^ V 
is the same thing as a functor C x [1] V such that the front pane is sent via F 
and the back pane is sent via G. The components are captured by the front-to-back 
morphisms, and the naturality is captured by the commutativity of the front-to-back 
squares in C x [1]. 

Remark 4.5.3.23. Recall in Section 2.7.6.6 we described relative sets. In fact. Definition 
2.7.6.7 basically defines a category of relative sets over any fixed set B. Let 1 denote the 
discrete category on one object, and note that providing a functor 1 Set is the same 
as simply providing a set, so consider B: 1^ Set. Then the slice category Set/^, as 
defined in Definition 4.5.3.18 is precisely the category of relative sets over B: it has the 
same objects and morphisms as was described in Definition 2.7.6.7. 



4.5.3.24 Definition of colimit 

The definition of colimits is appropriately "dual" to the definition of limits. Instead 
of looking at left cones, we look at right cones; instead of being interested in terminal 
objects, we are interested in initial objects. 

Definition 4.5.3.25. Let C be a category, let / be a category; let be the right cone 
on /, and let z : I ^ be the inclusion. Suppose that X : / ^ C is an /-shaped diagram 
in C. The coslice category of C over X denoted Cx/ is the category whose objects and 
morphisms are as follows: 

Oh{Cx/) = {S: I"^ ^C\Soi = X} 
Homc^/ (S^S^) = {a: S ^ \ aoi = idx } • 



A colimit of X^ denoted colim/ X or colimX, is an initial object in Cx/- 
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Pushouts The relevant indexing category for pushouts is the span, 1 = 2^ drawn as 
to the left below: 



X: I 





An /-shaped diagram in C is a functor X : I ^ which we might draw as to the right 
above (e.g. Xq e Ob(C)). 

An object S in the coslice category Cx/ is a commutative diagram S : I^ ^ C over 
X, which looks like the box to the left below: 



f--s 



S e Ob(Cx/) 



^0 ■ 



' X2 




A morphism in Cx/ is drawn in the dashbox to the right above. An initial object in C/x 
is precisely the "gateway" we want; i.e. the colimit of X is the pushout, Xi uxq X2. 

Exercise 4.5.3.26. Let / be the graph indexing category (see 4.7). 

a. ) What is 

b. ) Now let G: / ^ Set be the graph from Example 3.3.1.2. Give an example of an 

object in SetQ/. 

c. ) We have already given a name to the colimit of G: I ^ Set; what is it? 



Exercise 4.5.3.27. Let C be a category and let / = be the empty category. There is a 
unique functor X : ^ C. 

a. ) What is the coslice category C^/? 

b. ) What is the colimit of X? 



170 



CHAPTER 4. BASIC CATEGORY THEORY 



Example 4.5.3.28 (Cone as colimit). We have now defined colimits in any category, so we 
have defined colimits in Cat. Let C be a category and recall from Example 4.5.3.22 the 
category C x [1] . The inclusion of the front pane is a functor zq: C ^ C x [1] (similarly, 
the inclusion of the back pane is a functor ii : C ^ C x [1]) . Finally let t: C ^ 1 be the 
unique functor to the terminal category (see Exercise 4.1.2.34). We now have a diagram 
in Cat of the form 

C X [1] 

t 

Y 
1 

The colimit (i.e. the pushout) of this diagram in Cat slurps down the entire front pane 
of C X [1] to a point, and the resulting category is isomorphic to C^. Here is a drawing 
of this phenomenon 




t 




C-^(Cx[l])ucl 



(Similarly, the pushout of the analogous diagram for ii would give C^.) 

Example 4.5.3.29. Consider the category Top of topological spaces. The (empty) circle 
is a topological space which people often denote (for "1-dimensional sphere"). The 
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filled-in circle, also called a 2-dimensional disk, is denoted D^. The inclusion of the circle 
into the disk is continuous so we have a morphism in Top of the form i: ^ . The 
terminal object in Top is the one-point space {©}, and so there is a unique morphism 

t\ {©}. The pushout of the diagram ^ {©} is isomorphic to the 

2-dimensional sphere (the exterior of a tennis ball), S'^. The reason is that we have 
slurped the entire bounding circle to a point, and the category of topological spaces has 
the right morphisms to ensure that the resulting space really is a sphere. 

Application 4.5.3.30. Consider the symmetric graph Gn consisting of a chain of n vertices. 



Think of this as modeling a subway line. There are n-many graph homomorphisms 
Gi Gn given by the various vertices. One can create transit maps using colimits. For 
example, the colimit of the diagram to the left is the symmetric graph drawn to the right 
below. 



colim 




can be drawn 



•5 



4 

•4 ■ 



•3 

I 

3 



6 



5 
■ •2 



00 



4.6 Other notions in Cat 

In this section we discuss some leftover notions about categories. For example in Section 
4.6.1 we explain a kind of duality for categories, in which arrows are flipped. For example 
reversing the order in a preorder is an example of this duality, as is the similarity between 
limits and colimits. In Section 4.6.2 we discuss the so-called Grothendieck construction 
which in some sense graphs functors, and we show that it is useful for transforming 
databases into the kind of format (RDF) used in scraping data off webpages. We define 
a general construction for creating categories in Section 4.6.4. Finally, in Section 4.6.5 
we show that precisely the same arithmetic statements that held for sets in Section 2.7.3 
hold for categories. 

4.6.1 Opposite categories 

People used to discuss two different kinds of functors between categories: the so-called 
covariant functors and the so-called contravariant functors. Covariant functors are what 
we have been calling functors. The reader may have come across the idea of contravari- 
ance when considering Exercise 4.2.3.2.^^ There we saw that a continuous mapping of 



Similarly, see Exercise 4.2.4.4). 
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topological spaces f : X ^ Y does not induce a morphism of orders on their open sets 
Op(X) Op(y'); that is not required by the notion of continuity. Instead, a morphism 
of topological spaces f : X ^ Y induces a morphism of orders Op(F) Op(X), going 
backwards. So we do not have a functor Top PrO in this way, but it's quite close. 
One used to say that Op is a contravariant functor Top PrO. 

As important and common as contravariance is, people found that keeping track of 
which functors were covariant and which were contravariant was a big hassle. Luck- 
ily, there is a simple work-around, which simplifies everything: the notion of opposite 
categories. 

Definition 4.6.1.1. Let C be a category. The opposite category of C, denoted C°p, has 
the same objects as C, i.e. Ob(C°P) = Ob(C), and for any two objects c, c', one defines 

Hom^op (c,c') := Hom^ (c',c). 

Example 4.6.1.2. If n e M is a natural number and n the corresponding discrete category, 
then = n. Recall the span category 1 = 2^ from Definition 4.5.1.8. Its opposite is 
the cospan category = 2^, from Definition 4.5.1.23. 

Exercise 4.6.1.3. Let C be the category from Example 4.5.3.16. Draw C°p. 

Lemma 4.6.1.4. Let C and V he categories. One has (C°p)°p = C. Also we have 
Fun(C,P) ^ Fun(C°P,P°P). This implies that a functor C°p V can he identified with 
a functor C P°p. 

Proof. This follows straightforwardly from the definitions. 

□ 

Exercise 4.6.1.5. In Exercises 4.2.3.2, 4.2.4.3, and 4.2.4.4 there were questions about 
whether a certain function Ob(C) Oh{V) extended to a functor C ^ V. In each case, 
see if the proposed function would extend to a "contravariant functor" i.e. to a functor 
C°P ^V. 

Example 4.6.1.6 (Simplicial sets). Recall from Example 4.3.4.4 the category A of linear 
orders [n]. For example, [1] is the linear order 0^1 and [2] is the linear order ^ 1 ^ 2. 
Both [1] and [2] are objects of A. There are 6 morphisms from [1] to [2], which we 
could denote 

HomA([l], [2]) = {(0, 0), (0, 1), (0, 2), (1, 1), (1, 2), (2, 2)}. 

It may seem strange, but the category A°p turns out to be quite useful in algebraic 
topology. It is the indexing category for a combinatorial approach to the homotopy 
theory of spaces. That is, we can represent something like the category of spaces and 
continuous maps using the functor category sSet := Fun(A°P, Set), which is called the 
category of simplicial sets. 

This may seem very complicated compared to something we did earlier, namely sim- 
plicial complexes. But simplicial sets have excellent formal properties that simplicial 
complexes do not. We will not go further with this here, but through the work of Dan 
Kan, Andre Joyal, Jacob Lurie, and many others, simplicial sets have allowed category 
theory to pierce deeply into the realm of topology and vice versa. 
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4.6.2 Grothendieck construction 

Let C be a database schema (or category) and let /: C ^ Set be an instance. We have 
been drawing this in table form, but there is another standard way of laying out the data 
in /, called the resource descriptive framework or RDF. Developed for the web, RDF is a 
useful format when one does not have a schema in hand, e.g. when scraping information 
off of a website, one does not know what schema will be best. In these cases, information 
is stored in so-called RDF triples, which are of the form 

(Subject, Predicate, Object) 



For example, one might see something like 



Subject 


Predicate 


Object 


AOl 


occuredOn 


D13114 


AOl 


performedBy 


P44 


AOl 


actionDescription 


Told congress to raise debt ceiling 


D13114 


has Year 


2013 


D13114 


hasMonth 


January 


D13114 


hasDay 


14 


P44 


FirstName 


Barack 


P44 


LastName 


Obama 



Category-theoretically, it is quite simple to convert a database instance I: C ^ Set 
into an RDF triple store. To do so, we use the Grothendieck construction^ which is more 
aptly named the category of elements construction, defined below. 

Definition 4.6.2.1. Let C be a category and let / : C ^ Set be a functor. The category 
of elements of /, denoted /, is defined as follows: 

Ob(J/):= |cGOb(C),xG/(c)}. 
c 

Homj^KC^,^), {C',x')) := {f : C ^ C \ I{f){x) = x'}. 

There is a natural functor tt/ : / — > C. It sends each object (C, x) g Ob(5^ /) to 
the object C g Ob(C). And it sends each morphism /: (C, x) {C ^ x') to the morphism 
f-.C^C. 

Example 4.6.2.2. Let A be a set, and consider it as a discrete category. We saw in 
Exercise 4.3.3.3 that a functor S: A ^ Set is the same thing as an A-indexed set, as 
discussed in Section 2.7.6.10. We will follow Definition 2.7.6.12 and for each a e A write 
Sa := S{a). 

What is the category of elements of a functor S: A ^ Set? The objects of S are 
pairs (a, s) where a e A and s g S{a). Since A has nothing but identity morphisms, S 

Apparently, Alexander Grothendieck did not invent this construction, it was discussed prior to 
Grothendieck's use of it, e.g. by Mac Lane. But more to the point, the term Grothendieck construction is 
not grammatically suited in the sense that both the following are awkward in English: "the Grothendieck 
construction of / is ..." (awkward because / is not being constructed but used in a construction) and 
"the Grothendieck construct for / is..." (awkward because it just is). The term category of elements is 
more descriptive and easier to use grammatically. 
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has nothing but identity morphisms; i.e. it is the discrete category on a set. In fact that 
set is the disjoint union 




The functor tts- ^^S ^ A sends each element in Sa to the element a e A. 

One can see this as a kind of histogram. For example, let A = {BOS, NYC, LA, DC} and 
let : A ^ Set assign 



^Bos = {Abby, Bob, Casandra}, 

^NYC = 07 

Sla = {John, Jim}, and 
S'dc = {Abby, Car la}. 

Then the category of elements of S would look like the (discrete) category at the top: 



(BOS, Abby) 






• 






(BOS,Bob) 


(LA, John) 


(DC, Abby) 


• 


• 


• 


(BOS, Casandra) 


(LA, Jim) 


(DC,Carla) 


• 


• 


• 



(4.22) 



TVS 



A = 



BOS 


NYC 


LA 


DC 


• 


• 


• 


• 



We also see that the category of elements construction has converted an A-indexed 
set into a relative set over A, as in Definition 2.7.6.7. 

The above example does not show at all how the Grothendieck construction trans- 
forms a database instance into an RDF triple store. The reason is that our database 
schema was A, a discrete category that specifies no connections between data (it simply 
collects the data into bins). So lets examine a more interesting database schema and 
instance. This is taken from [Sp2]. 

Application 4.6.2.3. Consider the schema below, which we first encountered in Example 
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3.5.2.1: 



C :-- 



Employee manager worksin ^ Employee worksin 
Department secretary worksin ^ Department 



manager 




worksin 



Department 



secretary 



FirstNameString 



LastNameString 



Department NameSt ring 



(4.23) 



And consider the instance I: C ^ Set, which we first encountered in (3.11) and (3.13) 



Employee 


ID 


first 


last 


manager 


worksin 


101 


David 


Hilbert 


103 


qlO 


102 


Bertrand 


Russell 


102 


x02 


103 


Emmy 


Noether 


103 


qlO 



Department 


ID 


name 


secretary 


qlO 


Sales 


101 


x02 


Production 


102 



FirstNameString 



ID 



Alan 



Bertrand 



Carl 



David 



Emmy 



LastNameString 



ID 



Arden 



Hilbert 



Jones 



Noether 



Russell 



DepartmentNameString 



ID 



Marketing 



Production 



Sales 



The category of elements of /: C ^ Set looks like this: 
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manager 




worksin 



Department 



secretary 



FirstNameString LastNameString 



DepartmentNameString 



In the above drawing (4.24) of /, we left out 10 arrows for ease of readability, for 

102 first Bertrand 

example, we left out an arrow • > • 

For the punchline, how do we see the category of elements / as an RDF triple 
store? For each arrow in /, we take the triple consisting of the source vertex, the 
arrow name, and the target vertex. So our triple store would include triples such as 
(102 first Bertrand) and (101 manager 103>. 

00 

Exercise 4.6.2.4. Come up with a schema and instance whose category of elements con- 
tains (at least) the data from (4.21). 
Exercise 4.6.2.5. Recall from Section 3.1.2.10 that a finite state machine is a free monoid 
(List(S), [ ],^) acting on a set X. Recall also that we can consider a monoid as a 
category Ai with one object and a monoid action as a set- valued functor F: M. ^ Set, 
(see Section 4.2.1.1). In the case of Figure 3.1 the monoid in question is List(a, 6), which 
can be drawn as the schema 

a h 

I CO I 

and the functor F: A4 ^ Set is recorded in an action table in Example 3.1.3.1. What 
is 5^ F? How does it relate to the picture in Figure 3.1? 
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4.6.3 Full subcategory 

Definition 4.6.3.1. Let C be a category and let X ^ Ob(C) be a set of objects in C. 
The full subcategory of C spanned by X is the category, which we denote by Coh=Xi with 
objects Ob(Cob=x) X and with morphisms Hom^Q^^^ (x, x') := Homc(x,x'). 

Example 4.6.3.2. The following are examples of full subcategories. We will name them in 
the form "X inside of F", and each time we mean that X and Y are names of categories, 
the category X can be considered as a subcategory of the category Y in some sense, and 
it is full. In other words, all morphisms in Y "count" as morphisms in X. 

• Finite sets inside of sets. Fin ^ Set; 

• Finite sets of the form n inside of Fin; 

• Linear orders of the form [n] inside of all finite linear orders, A ^ FLin; 

• Groups inside of monoids, Grp ^ Mon; 

• Monoids inside of categories, Mon ^ Cat; 

• Sets inside of graphs. Set ^ Grph; 

• Partial orders (resp. linear orders) inside of PrO; 

• Discrete categories (resp. indiscrete categories) inside of Cat; 

Remark 4.6.3.3. A subcategory C ^ P is just a functor i: C ^ V that happens to be 
injective on objects and arrows. The subcategory is full if and only if z is a full functor 
in the sense of Definition 4.3.4.12. 

Example 4.6.3.4. Let C be a category, let X ^ Ob(C) be a set of objects, and let Cob=x 
denote the full subcategory of C spanned by X. We can realize this as a fiber product 
of categories. Indeed, recall that for any set, we can form the indiscrete category on 
that set; see Example 4.3.4.3. In fact, we have a functor Ind: Set Cat. Thus 
our function X Ob(C) can be converted into a functor between indiscrete categories 
Ind{X) Ind{Oh{C)). There is also a functor C Ind{Oh{C)) sending each object to 
itself. Then the full subcategory of C spanned by X is the fiber product of categories, 

Coh=x ^ C 

Y Y 
Ind{X) ^ Ind{Oh{C)) 

4.6.4 Comma categories 

Category theory includes a highly developed and interoperable catalogue of materials 
and production techniques. One such is the comma category. 

Definition 4.6.4.1. Let A,B, and C be categories and let F: A ^ C and G: B ^ C 
be functors. The comma category of C morphisms from F to G, denoted {F [c G) or 
simply (F I G), is the category with objects 

Ob(F i G) = {{a,bj) \aeOh{A),beOh{B)J: F{a) G{b) in C} 
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F{a) 
F{a') 



G{r) 

■G{h') 



Figure 4.25: A morphism {q, r) : (a, 6, /) — > (a^ 6^ f) in (F [ G) 



and for any two objects (a^b^f) and {a' ^h' ^ f) the set Hom^^ic') ((a, 6, /), (a', 6^, /')) of 
morphisms (a^b^f) — > {a\b\f') is 

{((7,r) I a^a' in^, r: 6^6' inS, such that f o F{q) =G{r)of}. 

Example 4.6.4.2. Let C be a category and I : C ^ Set a functor. In this example we 
show that the comma category construction captures the notion of taking the category 
of elements /; see Definition 4.6.2.1. 

Consider the set 1, the category Disc{l)^ and the functor F: Disc{l) Set sending 

F I 

the unique object to the set 1. We use the comma category setup 1 — "^ '^ 
There is an isomorphism of categories 



Set 



C. 



Indeed, an object in {F | /) is a triple (a^b^f) where a e Ob(l),6 e Ob(C), and 
/: F{a) I{b) is a morphism in Set. There is only one object in 1, so this reduces to 
a pair (6,/) where b e Ob(C) and /: {©} ^ I{b). The set of functions {©} ^ I{b) is 
isomorphic to as we saw in Exercise 2.1.2.13. So we have reduced Ob(F | /) to the 
set of pairs (6, x) where b e Ob(C) and x e I{b); this is Ob(J^ /). Because there is only 
one function 1 ^ 1, a morphism (6, x) (b'^x') in [F [ I) boils down to a morphism 
r : b ^ b' such that the diagram 

1 



1-/(60 

X 

But such diagrams are in one-to-one correspondence with the diagrams needed for mor- 
phisms in I. 

Exercise 4.6.4.3. Let C be a category and let c^d e Ob(C) be objects. Consider them 

c c 

as functors c, c : 1 ^ C, and consider the setup 1 > C < 1. What is the comma 

category {c [ c')7 



4.6.5 Arithmetic of categories 

In Section 2.7.3, we summarized some of the properties of products, coproducts, and 
exponentials for sets, attempting to show that they lined up precisely with familiar 
arithmetic properties of natural numbers. Astoundingly, we can do the same for cate- 
gories. 
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In the fohowing proposition, we denote the coproduct of two categories A and B by 
the notation A + B rather than Au B. We also denote the functor category Fun(^, B) 
by B^. Finally, we use and 1 to refer to the discrete category on and on 1 object, 
respectively. 

Proposition 4.6.5.1. The following isomorphisms exist for any small categories A^B^ 
and C. 

• ^ + ^ ^ 

• A+B^B+A 

• {A + B)+C^A+{B + C) 

• ^ X ^ 

• ^ X 1 ^ ^ 

• AxB^BxA 

• {AxB)xC^Ax{BxC) 

• Ax{B + C)^{AxB) + {AxC) 

• A^^l 

• A^^A 

• 0-^ ^0, ifA^O 
. l^^l 

• A^+^ ^A^ xA^ 

• {A^f ^ ^^^^ 

Proof. These are standard results; see [Mac]. 

□ 
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Chapter 5 

Categories at work 



We have now set up an understanding of the basic notions of category theory: categories, 
functors, natural transformations, and universal properties. We have discussed many 
sources of examples: orders, graphs, monoids, and databases. We begin this chapter 
with the notion of adjoint functors, which are like dictionaries that translate back and 
forth between different categories. 

5.1 Adjunctions 

In the introduction to this chapter I said that adjoint functors are like dictionaries that 
translate back and forth between different categories. In the common understanding 
of dictionaries, we assume that the two languages (say French and English) are equally 
expressive, and that a good dictionary will be an even exchange of ideas. But in category 
theory we often have two categories that are not on the same conceptual level. This is 
most clear in the case of so-called free- forgetful adjunctions. 

5.1.1 Discussion and definition 

Consider the category of monoids and the category of sets. A monoid (M, id, ★) is a set 
with an identity element and a multiplication formula that is associative. A set is just 
a set. A dictionary between Mon and Set should not have to set up an even exchange, 
but instead an exchange that is appropriate to the structures at hand. It will be in 
the form of two functors, one we'll denote by L : Set Mon, and one we'll denote by 
R: Mon Set. But to say what "appropriate" means requires more work. 

Let's bring it down to earth. A one-year-old can make repeatable noises and an 
adult can make repeatable noises. One might say "after all, talking is nothing but 
making repeatable noises." But the adult's repeatable noises are called words, they form 
sentences, and these sentences can cause nuclear wars. There is a difference. A tennis 
match can be viewed as physics, but you won't see the match. So we have two categories 
here: ((repeated noises)) and ((meaningful words)). We need adjoint functors going back 
and forth. 

To translate baby talk into adult language we would make every repeated noise a 
kind of word, thereby granting it meaning. We don't know what it means, but we give 
it a slot in our lexicon, always pondering "I wonder what she means by Ronnon.." To 
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translate from meaningful words to repeatable noises is easy. We just hear the word as 
a repeated noise, which is how the baby hears it. 

Adjoint functors often come in the form of "free" and "forgetful". Here we freely add 
Ronnon to our lexicon without having any idea how it adheres to the rest of the child's 
noises or feelings. But it doesn't act like a sound to us, it acts like a word; we don't know 
what it means but we figure it means something. Conversely, the translation going the 
other way is "forgetful", forgetting the meaning of our words and just hearing them as 
sounds. The baby hears our words and accepts them as mere sounds, not knowing that 
there is anything extra to get. 

Back to sets and monoids, the sets are like the babies from our story: they are 
simple objects full of unconnected dots. The monoids are like adults, forming words and 
performing actions. Each element means something and combines with other elements 
in some way. We have lots of different sets and lots of different monoids, just as for 
babies and adults, but there are patterns to the behavior of each kind and we put them 
in different categories. 

Applying free functor L: Set — > Mon to a set X makes every element x e X a 
word, but they combine freely into lists. We discussed the free functor in Section 3.1.1.9; 
there are no relations between the "words" in these lists. Applying the forgetful functor 
R: Mon Set is merely forgetting that the elements of the monoid are anything but 
mere elements of a set. It sends a monoid (M, 1, ★) to the set M. 

But this is all just ideas. Let's give a definition, then return to our sets, monoids, 
sounds, and words. 

Definition 5.1.1.1. Let A and B be categories. An adjunction between A and S is a 
pair of functors 

L:B^A and R: A ^ B 
together with a natural isomorphism for any objects A e Ob(^) and B e Ob(S): 

aB,A^omA{L{B),A) Hom^(5, 

This isomorphism is called the mate isomorphism for the (L, R) adjunction, and for any 
morphism /: L{B) A in A, we refer to aB,A{f) • B R{A) as the mate of /. ^ 

The functor L is called the left adjoint and the functor R is called the right adjoint. 
We may say that L is the left adjoint of R or that R is the right adjoint of L. ^ We 
often denote this setup by 

L: B - — ^ A :R 

Proposition 5.1.1.2. Let L: Set Mon be the functor sending X e Ob(Set) to the 
free monoid L{X) := (List(X), [ ], as in Definition 3.1.1.12. Let R: Mon Set 
be the functor sending each monoid M := (M, 1,*) to its underlying set R{M) := M. 
Then L is left adjoint to R. 

Proof. If we can find a natural isomorphism of sets 

ax,M' ^omMon{L{X),M) Roms^t {X, R{M)) 
we will have succeeded in showing that these functors are adjoint. 

-•^ Conversely, for any g: B ^ R{A) in B we refer to ct^^j^{g) : L{B) ^ A as the mate of g. 
^The left adjoint does not have to be called L, nor does the right adjoint have to be called of 
course. This is suggestive. 
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Suppose given an element / g HomMon(^(-^), i-e. a monoid homomorphism 
/: List(X) M (sending [ ] to 1 and list concatenation to ★). Then in particular we 
can apply / to the singleton list [x] for any x E X. This gives a function X ^ M hy 
X ^ /(M), and this is ax,M{f) X ^ M = R{M.). We need only to supply an inverse 
pXM' Homset(X,i?(>/)) HomMon(i^(X), Al). 

Suppose given an element g e Homset(-^7 i-e. a function g: X ^ M. Then to 

any list I = [xi, ^2, . . . , Xn\ e List(X) we can assign I3x,m{^) * ^(^2) * • • • * g{xn) 

(if ^ = [ ] is the empty list, assign /3x,m([ ]) !)• We now have a function List(X) M. 
It is a monoid homomorphism because it respects identity and composition. It is easy 
to check that {3 and a are mutually inverse, completing the proof. 

□ 

Exercise 5.1.1.3. Let X = {a, 6, c} and let M = (M, 1, *) be the multiplicative monoid 
of natural numbers (see Example 3.1.3.2). Let f:X M be the function given by 
/(a) = 7J{b) = 2, /(c) = 2, and let : Homset(X, i?(A^)) - HomMon(i^(X), A^) 

be as in the proof of Proposition 5.1.1.2. What is (3x,M{f){[b^b,a^c])? 

Let us look once more at the adjunction between adults and babies. Using the nota- 
tion of Definition 5.1.1.1 A is the "category" of meaningful words and B is the "category" 
of repeated noises. The left adjoint turns every repeated sound into a meaningful word 
and the right adjoint forgets the meaning of any word and considers it merely as a sound. 

At the risk of taking this simple analogy too far, let's have a go at the heart of the 
issue: how to conceive of the isomorphism of Hom's. Once we have freely given a slot to 
each of baby's repeated sounds, we try to find a mapping from the lexicon L{B) of these 
new words to our own lexicon A of meaningful words; these are mappings in the adult 
category A of the form L{B) A. And the baby tries to find a mapping (which we might 
see as emulation) from the set B of repeatable sounds to the set R{A) of the sounds the 
adult seem to repeat. If there was a global system for making these transformations that 
would be an adjunction. 

Note that it makes little sense in this example to look for a mapping in the category of 
meaningful words from our lexicon to the wordifications of baby-sounds; there is unlikely 
to be a good candidate for most of our words. The next example shows that the "leftness" 
and "rightness" are not arbitrary. 

Example 5.1.1.4. Let L: Set Mon and R: Mon Set be the free and forgetful 
functors from Proposition 5.1.1.2. We know that L is left adjoint to R; however L is not 
right adjoint to R. In other words, we can show that the necessary natural isomorphism 
cannot exist. 

Let X = {a, 6} and let M = ({1},1,!) be the trivial monoid. Then the nec- 
essary natural isomorphism would need to give us a bijection Hom-^oni-M^ L{X)) = 
Homset({l}, -^)- But the left-hand side has one element, because Ai is the initial object 
in Mon (see Example 4.5.3.7), whereas the right-hand side has two elements. Therefore 
no isomorphism can exist. 

Example 5.1.1.5. Preorders have underlying sets, giving rise to a functor U : PrO Set. 
The functor U has both a left adjoint and a right adjoint. The left adjoint of U is 
D: Set PrO, sending a set X to the discrete preorder on X (the preorder with 
underlying set X, having the fewest possible ^'s). The right adjoint of U is /: Set 
PrO, sending a set X to the indiscrete preorder on X (the preorder with underlying set 
X, having the most possible ^'s). See Example 3.4.3.5. 
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Exercise 5.1.1.6. Let U : Grph Set denote the functor sending a graph to its under- 
lying set of vertices. This functor has both a left and a right adjoint. What are they? 



Example 5.1.1.7. Here are some other adjunctions: 

• Ob: Cat Set has a left adjoint Set Cat given by the discrete category. 

• Ob: Cat Set has a right adjoint Set Cat given by the indiscrete category. 

• The underlying graph functor Cat Grph has a left adjoint Grph Cat given 
by the free category. 

• The functor PrO Grph given by drawing edges for ^'s has a left adjoint given 
by existence of paths. 

• The forgetful functor from posets to preorders has a left adjoint given by quotient 
by isomorphism relation. 

• Given a set A, the functor — x A: Set Set has a right adjoint }iom{A^ — ) (this 
is currying). 

Exercise 5.1.1.8. Let F \ C ^ V and G:V^Che mutually inverse equivalences of 
categories (see Definition 4.3.4.1). Are they adjoint in one direction or the other? 

Exercise 5.1.1.9. The discrete category functor Disc: Set Cat has a left adjoint 
p: Cat Set. Describe it. Hint: look at the mate isomorphism at the component 
corresponding to [1] G Ob(Cat) and 2 g Ob(Set) 

5.1.1.10 Quantifiers as adjoints 

One of the simplest but neatest places that adjoints show up is between preimages and 
the logical quantifiers 3 and V. The setting in which to discuss this is that of sets and 
their power preorders. That is, if X is a set then recall from Section 3.4.2 that the power 
set P(X) has a natural ordering by inclusion of subsets. 

Given a function f : X ^ Y and a subset V ^ Y the preimage is f~^{V) := {x g 
X I f{x) G V}. IfV <^V then f-^{V') ^ f~^{V), so in fact /-^ : ¥{Y) P(X) can be 
considered a functor (where of course we are thinking of preorders as categories). The 
quantifiers appear as adjoints of f~^. 

Let's begin with the left adjoint of f~^. It is a functor L: P(X) ¥(Y). Choose an 
object /7 ^ X in P(X). It turns out that 

L{U) = {yeY\3xe f-\y) such that x g U}. 

And the right adjoint R: P(X) P(y), when applied to U is 

R{U) = {yeY\yxef-\y),xeU}. 

5.1.2 Universal concepts in terms of adjoints 

In this section we discuss how universal concepts, i.e. initial objects and terminal objects, 
colimits and limits, are easily phrased in the language of adjoint functors. 
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Example 5.1.2.1. Let C be a category and t: C 1 the unique functor to the terminal 
category. Then t is a left adjoint if and only if C has a terminal object, and t is a right 
adjoint if and only if C has an initial object. The proofs are dual, so let's focus on the 
first. 

The functor t has a right adjoint R: 1 ^ C if and only if there is an isomorphism 

Home (c,r) = Homi(t(c), 1), 

where r = R{1). But Homi(t(c), 1) has one element. Thus t has a right adjoint iff there 
is a unique morphism c ^ r in C. This is the definition of r being a terminal object. 

When we defined colimits and limits in Definitions 4.5.3.25 and 4.5.3.18 we did so for 
individual /-shaped diagrams X : I ^ C. Using adjoints we can define the limit of every 
/-shaped diagram in C at once. 

Let t: C ^ 1 denote the unique functor to the terminal category. Given an object 
c E Ob(C), consider it as a functor c: 1 ^ C. Then cot: I ^ C is the constant functor 
at c, sending each object in / to the same C-object c, and every morphism in / to idc. 
This induces a functor that we denote by : C ^ Fun(/,C). 

Suppose we want to take the colimit or limit of X. We are given an object X of 
Fun(/,C) and we want back an object of C. We could hope, and it turns out to be true, 
that the adjoints of are the limit and colimit. Indeed let : Fun(/,C) ^ C be the 
left adjoint of A^, and let : Fun(/,C) ^ C be the right adjoint of A^. Then is the 
functor that takes colimits, and Ht is the functor that takes limits. 

We will work with a generalization of colimits and limits in Section 5.1.4. But for 
now, let's bring this down to earth with a concrete example. 

Example 5.1.2.2. Let C = Set, and let / = 3. The category Fun(/, Set) is the category 
of {1, 2, 3}-indexed sets, e.g. (Z, M, Z) g Ob(Fun(/, Set)) is an object of it. The functor 
At'. Set Fun(/, Set) acts as follows. Given a set c g Ob(Set), consider it as a 
functor c: 1 ^ Set, and the composite cot: I ^ Set is the constant functor. That is, 
At{c) : I Set is the functor sending 1 ^ c, 2 ^ c, and 3 ^ c. 

To say that At has a right adjoint called Ht : Fun(/, Set) Set and that it "takes 
limits" should mean that if we look through the definition of right adjoint, we will see 
that the formula will somehow yield the appropriate limit. Fix a functor D: I ^ Set, so 
L>(1),D(2), and D{3) are sets. The limit limD of D is the product D{1) x D{2) x D{3). 
How does this fact arise in the definition of adjoint? 

The definition of H^ being the right adjoint to A^ says that there is a natural isomor- 
phism of sets, 

Homp^,(j,Set)(A(c),/)) ^ Homset(c,H,(/^)). (5.1) 

The left-hand side has elements / g Hompun(/,Set)(A(c), D) that look like the left below, 
but having these three maps is equivalent to having the diagram to the right below: 
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The isomorphism in (5.1) says that choosing the three maps /(I), /(2), /(3) is the same 
thing as choosing a function c 11^(1^). But this is very close to the universal property 
of limits: there is a unique map £: c ^ D{1) x D{2) x D{3)^ so this product serves well 
as as we have said. We're not giving a formal proof here, but what is missing at 
this point is the fact that certain diagrams have to commute. This comes down to the 
naturality of the isomorphism (5.1). The map i induces a naturality square 



D 



D 



which says that the following diagram commutes: 




D{1) X D{2) X D{3) 
5.1.3 Preservation of colimits or limits 

One useful fact about adjunctions is that left adjoints preserve all colimits and right 
adjoint s preserve all limits. 

Proposition 5.1.3.1. Let L: B < ^ A :R be an adjunction. For any indexing cate- 
gory I and functor D: I ^ B, if D has a colimit in B then there is a unique isomorphism 

L(colimD) = colim(L o D). 

Similarly, for any I e Ob(Cat) and functor D\ I ^ A, if D has a limit in A then 
there is a unique isomorphism 

R(\\mD) ^ \\m{RoD). 

Proof. The proof is simple if one knows the Yoneda lemma. I have decided to skip that 
to keep the book shorter. See []. 

□ 

Example 5.1.3.2. Since Ob: Cat Set is both a left adjoint and a right adjoint, it must 
preserve both limits and colimits. This means that if you want to know the set of objects 
in the fiber product of some categories, you can simply take the fiber product of the set 
of objects in those categories, 

Ob(^ xc B) ^ Oh{A) xob(c) Ob(S). 
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While the right-hand side might look daunting, it is just a fiber product in Set which is 
quite understandable. 

This is greatly simplifying. If one thinks through what defines a limit in Cat, one 
is dragged through notions of slice categories and terminal objects in them. These slice 
categories are in Cat so they involve several categories and functors, and it gets hairy 
or even hopeless to a beginner. Knowing that the objects are given by a simple fiber 
product makes the search for limits in Cat much simpler. 

For example, if [n] is the linear order category of length n then [n] x [m] has nm + 
n + m + 1 objects because [n] has n + 1 objects and [m] has m + 1 objects. 

Example 5.1.3.3. The "path poset" functor L: Grph PrO given by existence of paths 
(see Exercise 4.1.2.9) is left adjoint to the functor R: PrO Grph given by replacing 
^'s by arrows. This means that L preserves colimits. So taking the union of graphs G 
and H results in a graph whose path poset L{G u H) is the union of the path posets of 
G and H. But this is not so for products. 



Let G = 



/ b 
^ • 



. Then L{G) = L{H) = [1], the linear order of length 1. 



But the product G x H in Grph looks like the graph 




Its preorder L{G x H) does not have (a, a) ^ (a,^), whereas this is the case in L{G) x 
LiH). 

Exercise 5.1.3.4. The distributive law for addition of natural numbers says {A-\-B)xC = 
A X C -\- B X C. Show that this follows from the currying adjunction on Set. 



5.1.4 Data migration 

As we saw in Sections 4.2.2 and 4.2.2.5, a database schema is a category C and an instance 
is a functor I : C ^ Set. 

Notation 5.1.4.1. Let C be a category. Throughout this section we denote by C-Set 
the category Fun(C, Set) of functors from C to Set, i.e. the category of instances on C. 

In this section we discuss what happens to the resulting instances when different 
schemas are connected by a functor, say F: C ^ V. It turns out that three adjoint func- 
tors emerge: Ap: V-Set C-Set, S^: C-Set P-Set, and Up : C-Set P-Set, 
where A^? is adjoint to both, 

Sf: C-Set ^ — ^ V-Set :Af Af: VSet ^ — t C-Set :Uf. 

It turns out that almost all the basic database operations are captured by these three 
functors. For example, Af handles the job of duplicating or deleting tables, as well as 
duplicating or deleting columns in a single table. The functor H^? handles taking unions, 
and the functor 11^ handles joining tables together, matching columns, or selecting the 
rows with certain properties (e.g. everyone whose first name is Mary). 
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5.1.4.2 Fullback: A 

Given a functor F: C ^ V and a functor I : V —> Set, we can compose them to get a 
functor I o F: C ^ Set. In other words, the presence of F provides a way to convert V- 
instances into C-instances. In fact this conversion is functorial, meaning that morphisms 
of P-instances are sent to morphisms of C-instances. We denote the resulting functor by 
Ap: V-Set C-Set and call it puUback along F. 

We have seen an example of this before in Example 4.3.2.15, where we showed how a 
monoid homomorphism F: A4' —> A4 could add functionality to a finite state machine. 
More generally, we can use pullbacks to reorganize data, copying and deleting tables and 
columns. 

Remark 5.1.4.3. Given a functor F: C ^ which we think of as a schema translation, 
the functor Ap' V-Set C-Set "goes the opposite way", which is often very strange 
to people at first. The rough idea is that if I get my information from you, then your 
information becomes my information. 

Consider the following functor F : C ^ V: ^ 



C :-- 




Salary- 




Salary 



= : V 



(5.2) 



Let's spend a moment recalling how to "read" schemas. In schema C there are leaf 
tables SSN, First, Last, Salary, which represent different kinds of basic data. More 
interestingly, there are two fact tables. The first is called Tl and it relates SSN, First, 
and Last. The second is called T2 and it relates First, Last, and Salary. 

The functor F: C ^ V relates C to a schema with a single fact table relating SSN, 
First, Last, and Salary. We are interested in A^?: V-Set C-Set. Suppose given 
the following database instance F. V ^ Set on V: 



T 


ID 


SSN 


First 


Last 


Salary 


XF667 


115-234 


Bob 


Smith 


$250 


XF891 


122-988 


Sue 


Smith 


$300 


XF221 


198-877 


Alice 


Jones 


$100 



^This example was taken from [Spl]. 
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SSN 
ID 

115-234 
118-334 



122-988 



198-877 



342-164 



First 
ID 

Adam 
Alice 



Bob 



Carl 



Sam 



Sue 



Last 




Salary 


ID 




ID 


Jones 




$100 


Miller 




$150 


Pratt 




$200 


Richards 




$250 


Smith 




$300 



How do you get the instance Af{I): C Set? The formula was given above: 
compose with F. In terms of tables, it feels like duplicating table T as Tl and T2, but 
deleting a column from each in accordance with the definition of C. Here is the result in 
table form: 



Tl 


ID 


SSN 


First 


Last 


XF667 


115-234 


Bob 


Smith 


XF891 


122-988 


Sue 


Smith 


XF221 


198-877 


Alice 


Jones 



T2 


ID 


First 


Last 


Salary 


XF221 


Alice 


Jones 


$100 


XF667 


Bob 


Smith 


$250 


XF891 


Sue 


Smith 


$300 



SSN 



ID 



115-234 



118-334 



122-988 



198-877 



342-164 



First 



ID 



Adam 



Alice 



Bob 



Carl 



Sam 



Sue 



Last 




Salary 


ID 




ID 


Jones 




$100 


Miller 




$150 


Pratt 




$200 


Richards 




$250 


Smith 




$300 



Exercise 5.1.4.4. Let C = (G, ^) be a schema. A leaf table is an object c g Ob(C) with 
no outgoing arrows. 

a. ) Write this condition mathematically in three different languages: that of graphs 

(using symbols src^tgt)^ that of categories (using Hom^, etc.), and that of 
tables (in terms of columns, tables, rows, etc.). 

b. ) In the language of categories, is there a difference between a terminal object and a 

leaf table? 



Exercise 5.1.4.5. Consider the schemas 

and 



[1] 



/ 



[2] 



g 


1 h 


» to 


• 







and the functor F: [1] ^ [2] given by sending 0^0 and 1^2. 
a.) How many possibilities are there for F{f)7 
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b.) Now suppose /: [2] ^ Set is given by the following tables. 










ID 


g 




1 


Am 


To be verb 




ID 


h 


Baltimore 


Place 




Action verb 


Verb 


Carla 


Person 




Adjective 


Adjective 


Develop 


Action verb 




Place 


Noun 


Edward 


Person 




Person 


Noun 


Foolish 


Adjective 




To be verb 


Verb 


Green 


Adjective 







ID 



Adjective 



Noun 



Verb 



Write out the two tables associated to the [l]-instance Af{I)' [1] Set. 



5.1.4.6 Left pushforward: S 

Let F: C P be a functor. The functor Ap- V-Set C-Set has a left adjoint, 
T^F ' C-Set V-Set. The rough idea is that T^f performs parameterized colimits. 
Given an instance I : C ^ Set, we get an instance on V that acts as follows. For each 
object d E Ob(P), the set Ei?(/)((i) is the colimit (think, union) of some diagram back 
home in C. 

The precise description can be found in [Spl]; here we just repeat some examples 
from that paper. 

Example 5.1.4.7. We again use the functor F: C ^ V drawn below 



C :-- 




(5.2) 



We will be applying the left pushforward S f ' C-Set 
/: C ^ Set: 



V-Set to the following instance 



Tl 


ID 


SSN 


First 


Last 


Tl-OOl 


115-234 


Bob 


Smith 


Tl-002 


122-988 


Sue 


Smith 


Tl-003 


198-877 


Alice 


Jones 



T2 


ID 


First 


Last 


Salary 


T2-001 


Alice 


Jones 


$100 


T2-002 


Sam 


Miller 


$150 


T2-004 


Sue 


Smith 


$300 


T2-010 


Carl 


Pratt 


$200 
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SSN 
ID 

115-234 
118-334 



122-988 



198-877 



342-164 



First 
ID 

Adam 
Alice 



Bob 



Carl 



Sam 



Sue 



Last 




Salary 


ID 




ID 


Jones 




$100 


Miller 




$150 


Pratt 




$200 


Richards 




$250 


Smith 




$300 



The functor F: C ^ V sent both tables Tl and T2 to table T. Applying T^p will take 
the what was in Tl and T2 and put the union in T. The result T^fI: V Set is as 
follows: 



T 


ID 


SSN 


First 


Last 


Salary 


Tl-OOl 


115-234 


Bob 


Smith 


Tl-OOl. Salary 


Tl-002 


122-988 


Sue 


Smith 


Tl-002. Salary 


Tl-003 


198-877 


Alice 


Jones 


Tl-003.Salary 


T2-001 


T2-A101.SSN 


Alice 


Jones 


$100 


T2-002 


T2-A102.SSN 


Sam 


Miller 


$150 


T2-004 


T2-004.SSN 


Sue 


Smith 


$300 


T2-010 


T2-A110.SSN 


Carl 


Pratt 


$200 



SSN 

ID 

115-234 



118-334 



122-988 



198-877 



342-164 



T2-001.SSN 



T2-002.SSN 



T2-004.SSN 



T2-010.SSN 



First 



ID 



Adam 



Alice 



Bob 



Carl 



Sam 



Sue 



Last 



ID 



Jones 



Miller 



Pratt 



Richards 



Smith 



Salary 



ID 



$100 



$150 



$200 



$250 



$300 



Tl-OOl. Salary 



Tl-002-Salary 



Tl-003-Salary 



As you can see, there was no set salary information for any data coming from table 
Tl nor any set SSN information for any data coming form table T2. But the definition 
of adjoint, given in Definition 5.1.1.1, yielded the universal response: freely add new 
variables that take the place of missing information. It turns out that this idea already 
has a name in logic, Skolem variables^ and a name in database theory, labeled nulls.. 

Exercise 5.1.4.8. Consider the functor F: 3^ 2 sending 1 ^ 1, 2 2, 3 ^ 2. 



a. ) Write down an instance I: C ^ Set. 

b. ) Given the description that "Si? performs a parameterized colimit", make an educated 

guess about what will be. Give your answer in the form of two sets that are 

made up from the three sets you already wrote down. 
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5.1.4.9 Right pushforward: 11 

Let F : C ^ V he Si functor. We heard in Section 5.1.4.6 that the functor Ap : V-Set 
C-Set has a left adjoint. Here we explain that it has a right adjoint, Hf ' C-Set V-Set 
as well. The rough idea is that Hp performs parameterized limits. Given an instance 
I: C ^ Set, we get an instance on V that acts as follows. For each object d e Ob(P), 
the set ni?(/)((i) is the limit (think, fiber product) of some diagram back home in C. 

The precise description can be found in [Spl]; here we just repeat some examples 
from that paper. 

Example 5.1.4.10. We once again use the functor F: C ^ V from Example 5.1.4.7. We 
will apply the right pushforward Up to instance / : C ^ Set from that example. ^ 

The instance IIf{I) will put data in all 5 tables in V. In T it will put pairs (^1,^2) 
where ti is a row in Tl and ^2 is a row in T2 for which the first and last names agree. 
It will copy the leaf tables exactly, so we do not display them here; the following is the 
table T for Up {I): 



T 


ID 


SSN 


First 


Last 


Salary 


T1-002T2-A104 


122-988 


Sue 


Smith 


$300 


T1-003T2-A101 


198-877 


Alice 


Jones 


$100 



Looking at Tl and T2, there were only two ways to match first and last names. 



To repeat for convenience, 




I: C ^ Set 



Tl 


ID 


SSN 


First 


Last 


Tl-OOl 


115-234 


Bob 


Smith 


Tl-002 


122-988 


Sue 


Smith 


Tl-003 


198-877 


Alice 


Jones 



(5.2) 



T2 


ID 


First 


Last 


Salary 


T2-001 


Ahce 


Jones 


$100 


T2-002 


Sam 


Miller 


$150 


T2-004 


Sue 


Smith 


$300 


T2-010 


Carl 


Pratt 


$200 



SSN 



ID 



115-234 



118-334 



122-988 



198-877 



342-164 



First 



ID 



Adam 



Alice 



Bob 



Carl 



Sam 
"S^ 



Last 




Salary 


ID 




ID 


Jones 




$100 


Miller 




$150 


Pratt 




$200 


Richards 




$250 


Smith 




$300 
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Exercise 5.1.4.11. Consider the functor F: 3^ 2 sending l^l,2^^2,3^-^2. 

a. ) Write down an instance I: C ^ Set. 

b. ) Given the description that "11^? performs a parameterized limit", make an educated 

guess about what IIf{I) wih be. Give your answer in the form of two sets that are 
made up from the three sets you already wrote down. 



5.2 Categories of functors 

For any two categories C and P, ^ we discussed the category Fun(C,P) of functors and 
natural transformations between them. In this section we discuss functor categories a bit 
more and give some important applications within mathematics (sheaves) that extend 
to the real world. 

5.2.1 Set- valued functors 

Let C be a category. Then we have been writing C-Set to denote the functor category 
Fun(C, Set). Here is a nice result about these categories. 

Proposition 5.2.1.1. Let C be a category. The category C -Set is closed under colimits 
and limits. 

Sketch of proof. Let J be an indexing category and D: J ^ C-Set a functor. For each 
object c G Ob(C), we have a functor Dc'. J ^ Set defined by Dc{j) = D{j){c). Define 
a functor L: C ^ Set by L(c) = \im j and note that for each f : c ^ c' in C there 
is an induced function L{f): L{c) L{c'). One can check using that L is a limit of J, 
because it satisfies the relevant universal property. 
The dual proof holds for colimits. 

□ 

Application 5.2.1.2. When taking in data about a scientific subject, one often finds 
that the way one thinks about the problem changes over time. We understand this 
phenomenon in the language of databases in terms of a series of schemas Ci, C2, . . . , C^, 
perhaps indexed chronologically. The problem is that old data is held in old schemas 
and we want to see it in our current understanding. The first step is to transfer all the 
old data to our new schema in the freest possible way, that is, making no assumptions 
about how to fill in the new fields. If one creates functors Fii Ci C^+i from each of 
these schemas to the next, then we can push the data forward using S^?. . 

Doing this we will have n datasets on V := Cn^ namely one for each "epoch of 
understanding". Since the category V-Set has all colimits, we can take the union of 
these datasets and get one. It will have many Skolem variables (see Example 5.1.4.7), 
and these need to be handled in a coherent way. However, the universality of left adjoints 
could be interpreted as saying that any reasonable formula for handling this old data 
can be applied to our results. 

00 



^Technically C has to be small (see Remark 4.1.1.2), but as we said there, we are not worrying about 
that distinction in this book. 
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Proposition 5.2.1.1 says that we can add or multiply database states together. In 
fact, database states on C form what is called a topos which means that just about every 
consideration we made for sets holds for instances on any schema. Perhaps the simplest 
schema is C = on which the relevant topos is indeed Set. But schemas can be 
arbitrarily complex, and it is impressive that all of these considerations make sense in 
such generality. Here is a table that makes a comparison between these domains. 



Dictionary between Set and C-Set 


Concept in Set 


Concept in C— Set 


Set 


Object in C-Set 


Function 


Morphism in C-Set 


Element 


Representable functor 


Empty set 


Initial object 


Natural numbers 


Natural numbers object 


Image 


Image 


(Co)limits 


(Co)limits 


Exponential objects 


Exponential objects 


"Familiar" arithmetic 


"Familiar" arithmetic 


Power sets 2^ 


Power objects 


Characteristic functions 


Characteristic functions 


Surjections, injections 


Epimorphisms, monomorphisms 



In the above table we said that elements of a set are akin to representable functors, 
but we have not yet defined those; we do so now. 

5.2.1.3 Representable functors 

Definition 5.2.1.4. Let C be a category and let c e Ob(C) be an object. The functor 
Homc(c, — ): C Set, sending d e Ob(C) to the set Hom^ (c,(i) and acting similarly 
on morphisms d d' ^ is said to be represented by c. If a functor F: C ^ Set is 
isomorphic to IIomc(c, — ), we say that F is a representable functor. We sometimes write 
Y{c) := Homc(c, — ) for short. 

Example 5.2.1.5. Given a category C and an object c e Ob(C), we get a representable 
functor. If we think of C as a database schema and c as a table, then what does the 
representable functor look like in databases? It turns out that the following procedure 
will generate it. 

Begin by writing a new row, say "©", in the ID column of table c. For each foreign 
key column /: c ^ c\ add a row in the ID column of table c' called "/(©)" and record 
that result (i.e. "/(®)") the / column of table c. Repeat as follows: for each table 
identify all rows r that have blank cell in column g: d ^ e. Add a new row called ''g{ry^ 
to table e and record that result in the (r, g) cell of table d. 

Here is a concrete example. Let C be the following schema: 

A f B 91 ^ c i D 
• ^ • ^ • ^ • 

92 



Then yB : C Set is the following instance 
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A 


ID 


/ 



B 


ID 


9i 


92 


h 


© 


5i(©) 


52(©) 


h{©) 



C 


ID 


i 


5i(©) 


i{9i{©)) 


92{©) 


i{92(©)) 



D 



ID 



E 



ID 



h{©) 



We began with a single element in table B and followed the arrows, putting new en- 
tries wherever they were required. One might call this the schematically implied reference 
spread or SIRS of the element © in table B. 

Representable functors Y{c) yield databases states that are as free as possible, subject 
to having the initial row © in table c. We have seen such things this before (by the name 
of Skolem variables) when studying the left pushforward S. Indeed, if c g Ob(C) is an 
object, we can consider it as a functor c: 1 ^ C. A database instance on 1 is the same 
thing as a set X. The left pushforward 5]c(X) has the same kinds of Skolem variables. 
If X = {©} is a one element set, then we get the representable functor Sc({©}) = ^(c). 



5.2.1.6 Yoneda's lemma 

One of the most powerful tools in category theory is Yoneda's lemma. It is often consid- 
ered by new students to be quite abstract, but grounding it in databases may help. 

The idea is this. Suppose that I: C ^ Set is a database instance, and let c e Ob(C) 
be an object. Because / is a functor, we know that for every row r g /(c) in table c a 
value has been recorded in the /-column, where f : c ^ c' is any outgoing arrow. The 
value in the (r, /)-cell refers to some row in table c' . What we're saying is that each row 
in table c induces SIRS throughout the database. They may not be "Skolem", or in any 
sense "freely generated", but they are there nonetheless. The point is that to each row 
in c there is a unique mapping Y(c) I. 

Lemma 5.2.1.7 (Yoneda's lemma). Let C he a category, c g Ob(C) an object, and 
I: C ^ Set an instance. There is a natural bisection 

Homc-Set(r(c),/)^-/(c). 

Proof. See [Mac]. 

□ 



5.2.2 Database instances in other categories 
5.2.2.1 Representations of groups 

The classical mathematical subject of representation theory is the study of Fun(G, Vect) 
where G is a group and Vect is the category of vector spaces (over say R).. Every such 
functor F: G ^ Vect is called a representation of G. Since G is a category with one 
object A, F consists of a single vector space V = F{k) together with an action of G on 
it. 

We can think of this in terms of databases if we have a presentation of G in terms of 
generators and relations. The schema corresponding to G has one table and this table 
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has a column for each generator. Giving a representation F is the same as giving an 
instance on our schema, with some properties that stem from the fact that our target 
category is Vect rather than Set. There are many possibilities for expressing ^ such 
data. 

One possibility is if we could somehow draw say if V is 1-, 2-, or 3-dimensional. 
If so, let P be our chosen picture of F, e.g. P is the standard drawing of a Cartesian 
coordinate plane. Then every column of our table would consist entirely of the picture 
P instead of a set of rows. Drawing a point in the ID-column picture would result in 
a point being drawn in each other column's picture, in accordance with the G-action. 
Each column would of course respect addition and scalar multiplication. 

Another possibility is to use the fact that there is a functor U : Vect Set, so our 
instance F : G ^ Vect can be converted to an ordinary instance U o F: G ^ Set. We 
would have an ordinary set of rows. This set would generally be infinite, but it would 
be structured by addition and scalar multiplication. For example, assuming V is finite 
dimensional, one could find a few rows that generated the rest. 

A third possibility is to use monads, which allow the table to have only as many rows 
as V has dimensions. This is a considerable savings of space. See Section 5.3. 

5.2.2.2 Representations of quivers 

Representation theory also studies representations of quivers. A quiver is just the free 
category (see Example 4.1.2.27) on a graph. If P is a graph with free category V then a 
representation of the quiver P is a functor F: V ^ Set. Such a representation consists 
of a vector space at every vertex of P and a linear transformation for every arrow. All of 
the discussion from Section 5.2.2.1 works in this setting, except that there is more than 
one table. 

5.2.2.3 Other target categories 

One can imagine the value of using target categories other than Set or Vect for databases. 

Application 5.2.2 A. Geographic data consists of maps of the earth together with various 
functions on it. For example for any point on the earth one may want to know the 
average temperature recorded in the past 10 years, or the precise temperature at this 
moment. Earth can be considered as a topological space, E. Similarly, temperatures 
on earth reside on a continuum, say the space T of real numbers [—100, 200]. Thus the 
temperature record is a function E ^ T. 

Other records such as precipitation, population density, elevation, etc. can all be 
considered as continuous functions from E to some space. Agencies like the US Geological 
Survey hold databases of such information. By modeling them on functors C Top, 
they may be able to employ mathematical tools such as persistent homology [WeS] to 
find interesting invariants of the data. 

00 

Application 5.2.2.5. Many other scientific disciplines could use the same kind of tool. For 
example, in studying the mechanics of materials, one may want to consider the material 
as a topological space M and measure values such as energy as a continuous M ^ E. 
Such observations could be modeled by databases with target category Top or Vect 
rather than Set. 

^We would use the term "representing" or "presenting", but they are both taken in the context of 
our narrative! 
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00 

5.2.3 Sheaves 

Let X be a topological space (see Example 4.2.3.1), such as a sphere. In Section 5.2.2.3 
we discussed continuous functions out of X, and their use in science (e.g. recording 
temperatures on the earth as a continuous map X [—100,200]). Sheaves allow us to 
consider the local-global nature of such maps, taking into account reparable discrepancies 
in data gathering tools. 

Application 5.2.3.1. Suppose that X is the topological space corresponding to the earth; 
by a region we mean an open subset U X. Suppose that we cover X with 10,000 
regions /727 • • • , ^loooo^ such that there are overlaps between some of these regions 
(e.g. perhaps n Ug ^ 0). For each i,j let Uij = Ui n Uj. 

For each region Ui ^ X we have a temperature recording device, which gives a 
function Ti \ Ui ^ [—100, 200]. If Ui nUj ^ then two different recording devices give 
us temperature data for the intersection Ui^j. Suppose we find that they do not give 
precisely the same data, but that there is a translation formula between their results. 
For example, Ti might register 3° warmer than Tj registers in that region. 

A consistent system of translation formulas is called a sheaf. It does not demand a 
universal "true" temperature function, but only a consistent translation system between 
them. 

00 

The following definitions makes the notion of sheaf precise, but we must go slowly 
(because it will already feel quick to the novice). For every region [/, we can record 
the value of some function (say temperature) throughout U ; although this record might 
consist of a mountain of data (a temperature for each point in [/!), we think of it as one 
thing. That is, it is one element in the set of value-assignments throughout U . A sheaf 
holds the set of possible values-assignments-throughout-[/'s for all the different /7's, as 
well as how a value-assignment throughout U restricts to a value-assignment throughout 
V for any subset V ^ U. 

Definition 5.2.3.2. Let X be a topological space, let Op(X) denote its partial order 
of open sets, and let Op(X)°P be the opposite category. A presheaf on X is sl functor 
O: Op(X)°P Set. For every open set U ^ X we refer to the set 0{U) as the set of 
values- assignments throughout U of O. If F ^ is an open subset, it corresponds to 
an arrow in Op(X) and applying the functor O yields a function called the restriction 
map from U to V and denoted pv,u ' ^{U) 0{V). Given a g 0{U)^ we may denote 
/)(y, U){a) by a|y; it is called the restriction of a to V . 

The category of presheaves on X is simply Op(X)°P-Set; see Definition 4.3.3.1. 

Exercise 5.2.3.3. a.) Come up with 4 overlapping open subsets that cover the square 
X := [0,3] X [0,3] ^ R^. Write down a label for each open set as well as a label 
for each overlap (2-fold, 3-fold, etc.); you now have labeled n open sets. For each of 
these open sets, draw a dot with the appropriate label, and then draw an arrow from 
one dot to another when the first refers to an open subset of the second. This is a 
preorder; call it Op(X). Now make up and write down formulas Ri: X ^ R and 
R2: X ^ R with Ri ^ , expressing a range of temperatures Ri (p) ^ x ^ R2 {p) 
that an imaginary experiment shows can exist at each point p in the square. 
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b. ) Suppose we now tried to make our presheaf O: Op(X)°P Set as follows. For each 

of your open sets, say A, we could put 

0{A) :={f:A^R \ i?i(a) ^ /(a) ^ i?2(a)}. 

What are the restriction maps? Do you like the name "value-assignment throughout 
A" for elements of (9(A)? 

c. ) We can now make another presheaf O' given the same experiment. For each of your 

open sets, say A, we could put 

0\A) := {/: A ^ R I / is continuous, and Ri{a) ^ f{a) ^ R2{a)}. 

Are you comfortable with the idea that there is a morphism of presheaves O' Ol 



Before we define sheaves, we need to clarify the notion of covering. If is a region 
and Vi, . . . ^Vn are subregions (i.e. for each 1 ^ z ^ n we have Vi ^ U. Then we say that 
the Vi cover U if every point in U is in Vi for some i. Another way to say this is that 
the natural function UiVi U is surjective. ^ 

Example 5.2.3.4. Let X = R be the space of real numbers, and define the following open 
subsets: U = (5, 10), Vi = (5, 7),V2 = (6, 9), = (7, 10). Then Vi, I/2, ^3 is a cover of U. 
It has overlaps V12 = VinV2 = (6, 7), V13 = Vi n I/3 = 0, V23 = V2nV3 = (7, 9). 

Given a presheaf O: Op(X)°P Set, we have sets and functions as in the following 
(incomplete) diagram 




0(U) ^ 0{V2) 




0{Vs) 

A presheaf O on X tells us what value-assignments throughout U can exist for each 
U. Suppose we have a value-assignment a e 0{U) throughout U and another value- 
assignment a' G 0{U') throughout U' ^ and suppose that they agree as value- assignments 
throughout U n U' ^ i.e. a\ur^u' = ct^lunU'- In this case we should have a unique value- 
assignment b E OuyjU' throughout U vjU' that agrees on the /7-part with a and agrees 
on the [/'-part with a'] i.e. h\u = a and h\u' = o! - This is the sheaf condition. 

Definition 5.2.3.5. Let X be a topological space, let Op(X) be its partial order of 
open sets, and let 0\ Op(X)°P Set be a presheaf. Given an open set /7 ^ X and a 
cover Vi, . . . , of /7, the following condition is called the sheaf condition for that cover. 

^To be pedantic, we should say that UiVi U is an epimorphism, which is the same thing as saying 
that on underlying points (i.e. via the "underlying points" functor Top Set) the map is a surjection. 
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Sheaf condition Given a sequence ai,...,an where each is a value- assignment e 
0{Vi) throughout Vi, and suppose that for ah i,j we have a^ly-ny^ = ^jWinVj] 
then there is a unique value-assignment b e Ou such that b\Vi = ai. 

The presheaf O is called a sheaf if it satisfies the sheaf condition for every cover. 

Example 5.2.3.6. Let XR and let [/, Vi, 1^2, Vs be the open cover given in Example 5.2.3.4. 
Given a measurement taken throughout Vi, a measurement taken throughout and 
a measurement taken throughout F3, we have elements ai e 0(Vi),a2 e 0(^2), and 
as E ©(Va). If they are in agreement on the overlap intervals, we can glue them to give 
a measurement throughout U. 

Application 5.2.3.7. Consider outer space as a topological space X. Different astronomers 
record observations. Let C = [390, 700] denote the set of wavelengths in the visible light 
spectrum (written in nanometers). Given an open subset U ^ X let 0{U) denote the 
set of functions U ^ C. The presheaf O satisfies the sheaf condition; this is the taken- 
for-granted fact that we can patch together different observations of space. 

Below are three views of the night sky. Given a telescope position to obtain the first 
view, one moves the telescope up and a little down to obtain the second and one moves 
it down and left to obtain the third. 
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These are value-assignments ai e 0(Vi),a2 e 0(V2)^ and as e 0{Vs) throughout subsets 
Vi,V2,Vs ^ X (respectively). These subsets Vi^V2^Vs cover some (strangely-shaped) 
subset U ^ X. The sheaf condition says that these three value-assignments glue together 
to form a single value-assignment throughout U : 




00 

Exercise 5.2.3.8. Find an application of sheaves in your own domain of expertise. 

Application 5.2.3.9. Suppose we have a sheaf for temperatures on earth. For every region 
U we have a set of theoretically possible temperature-assignments throughout U. For 
example we may know that if it is warm in Texas, warm in Arkansas, and warm in 
Kansas, then it cannot be cold in Oklahoma. With such a sheaf O in hand, one can 
use facts about the temperature in one region U to predict the temperature in another 
region V. 

The mathematics is as follows. Suppose given regions U^V ^ X and a subset A ^ 
0{U) corresponding to what we know about the temperature assignment throughout U. 
We take the following fiber product 

{pu,x)-HA) 0{X) ^ 0(V) 

J 

Pu,x 

A ^0(U) 

The image of the top map is a subset of OiV) telling us which temperature- assignments 
are possible throughout V given our knowledge A about the temperature throughout U. 

We can imagine the same type of prediction systems for other domains as well, such 
as the energy of various parts of a material. 00 

Example 5.2.3.10. In Exercises 4.2.4.3 and 4.2 A A we discussed the idea of laws being 
dictated or respected throughout a jurisdiction. If X is earth, to every jurisdiction 
U ^ X we assign the set 0{U) of laws that are dictated to hold throughout U. Given 
a law on U and a law on V, we can see if they amount to the same law on U nV. For 



5.2. CATEGORIES OF FUNCTORS 



201 



example, on /7 a law might say "no hunting near rivers" and on F a law might say "no 
hunting in public areas". It just so happens that onU nV all public areas are near rivers 
and vice versa, so the laws agree there. These laws patch together to form a single rule 
about hunting that is enforced throughout the union UvjV^ respected by all jurisdictions 
within it. 

5.2.3.11 Sheaf of ologged concepts 

Definition 5.2.3.5 defines what should be called a sheaf of sets. We can discuss sheaves 
of groups or even sheaves of categories. Here is an application of the latter. 

Recall the notion of simplicial complexes discussed in Section 2.7.4.3. They look like 
this: 




Given such a simplicial complex X, we can imagine each vertex v e Xq as an entity with 
a worldview (e.g. a person) and each simplex as the common worldview shared by its 
vertices. To model this, we will assign to each vertex v e X Siii olog corresponding 
to the worldview held by that entity, and to each simplex u e X^, we assign an olog 0{u) 
corresponding to a common ground worldview . . Recall that X is a subset of P(Xo); it is 
a preorder and its elements (the simplices) are ordered by inclusion. Ifu^v are simplices 
with u ^ V then we want a map of ologs (i.e. a schema morphism) 0{v) 0{u) 
corresponding to how any idea that is shared among the people in v is shared among 
the people in u. Thus we have a functor O: X ^ Sch (where we are forgetting the 
distinction between ologs and databases for notational convenience). 

To every simplicial complex (indeed every ordered set) one can associate a topological 
space; in fact we have a functor Alx: PrO Top, called the Alexandrov functor. 
Applying A/x(X°p) we have a space which we denote by A'. One can visualize A' as X, 
but the open sets include unions of simplices. There is a unique sheaf of categories on 
A' that behaves like X on simplices. 

How does this work in the case of our sheaf O of worldviews? For simplices such as 
(A) or (C/), the sheaf returns the olog corresponding to that person or shared worldview. 
But for open sets like the union of (CIJ) and (IJK)^ what we get is the olog consisting 
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of the types shared by C, /, and J for which / and J affirm agreement with types shared 
by /, J, and K. 

Example 5.2.3.12. Imagine two groups of people Gi and G2 each making observations 
about the world. Suppose that there is some overlap H = Gi nG2- Then it may happen 
that there is a conversation including Gi and G2 and both groups are talking about 
something and, although using different words, H says "you guys are talking about the 
same things, you just use different words." In this case there is an object-assignment 
throughout Gi u G2 that agrees with both those on Gi and those on G2- 



5.2.3.13 Time 



One can use sheaves to model objects in time; Goguen gave an approach to this in [Gog]. 
For another approach, let C be a database schema. The lifespan of information about the 
world is generally finite; that is, what was true yesterday is not always the case today. 
Thus we can associate to each interval U of time the information that we deem to hold 
throughout U. This is sometimes called the valid time of the data. 

If something is the case throughout U and we have a subset V ^ U then of course 
it is the case throughout V. And the sheaf condition holds too: if some information 
holds throughout U and some other information holds throughout U\ and if these two 
things restrict to the same information on the overlap U nV^ then they can be glued to 
information that holds throughout the union U u V. 

So we can model information-change over time by using a sheaf of C-sets on the 
topological space R. One way to think of this is simply as an instance on the schema 
C X Op(R)°P. The sheaf condition is just an added property that our instances have to 
obey. 

Example 5.2.3.14. Consider a hospital in which babies are born. In our scenario, mothers 
enter the hospital, babies are born, mothers and babies leave the hospital. Let C be the 
schema 



a baby 



was birthed by 



a mother 



Consider the 8-hour intervals 



Shifti 
Shifts 
Shifts 



(Jan 1 - 00 : 00, Jan 1 - 08 : 00), 
(Jan 1 - 04 : 00, Jan 1 - 12 : 00), 
(Jan 1 - 8 : 00, Jan 1 - 16 : 00). 



The nurses take shifts of 8 hours, overlapping with their predecessors by 4 hours, and they 
record in the database only patients that were there throughout their shift or throughout 
any overlapping shift. A mother might be in the hospital throughout shift 1, arriving 
before the new year. A baby is born at 05:00 on Jan 1, and thus does not make it into 
the Shifti -table, but does make it into the (Shifti n Shifts) -table. The two are there 
until 17:00 on Jan 1, and so they are recorded in the Shifts and Shifts tables. 

Whether or not this implementation of the sheaf semantics is most useful in practice 
is certainly debatable. But something like this could easily be useful as a semantics, i.e. 
a way of thinking about, the temporal nature of data. 
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5.3 Monads 

Monads would never have been invented without category theory, but they have been 
quite useful in formalizing algebra, calculating invariants of topological spaces, and 
imbedding non-functional operations into functional programming languages. We will 
mainly discuss monads in terms of how they can help us make explicit the context in 
which we're working, and in so doing relegate it to the background of our discussion. 
Much of the following material on monads is taken from [Sp3]. 



5.3.1 Monads formalize context 

Monads can formalize assumptions about the way one will do business throughout a 
domain. For example, suppose that we want to consider functions that do not have to 
return a value for all inputs. Such partial functions can be composed. Indeed, given 
a partial function f: A ^ B and a partial function g: B C, one gets a function 
g o f: A ^ C in diii obvious way. 

Here we are drawing arrows as though we are talking about functions, but there is an 
implicit context in which we are actually talking about partial functions. Monads allow 
us to write things in the "functional" way while holding the underlying context. What 
makes them useful is that the notion of context we are using here is made formal. 

Example 5.3.1.1. Partial functions can be modeled by ordinary functions, if we add a 
special "no answer" element to the codomain. That is, the set of partial functions A ^ B 
is in one-to-one correspondence with the set of ordinary functions A ^ B u {©}. For 
example, suppose we want to model the partial function f{x) := ^2_i : M ^ M in this 
way, we would use the function 



© ifx = -l, 
© ifx = l, 
^ ifx^O. 



An ordinary function f: A ^ B can be considered a partial function because we can 
compose with the inclusion 

B ^ Bu{©} (5.4) 



But how do we compose two partial functions written in this way? Suppose f : A ^ 
B u {Q} and g: B ^ C u {©} are functions. First form a new function 

g' := gu{©}: Bu{©} ^Cu {©} u {©} (5.5) 



then compose to get {g' o f): A^Cu{©}u{©}, and finally send both ©'s to the same 
element by composing with 

Cu{©}u{©}^Cu{©}. (5.6) 



What does this mean? Every element a e A is sent by / to either an element b e B 
or "no answer". If it has an answer /(a) g 5, this is either sent by g to an element 
g{f{a)) E C or to "no answer". We get a partial function A ^ C hy sending a to g{f{a)) 
if possible or to "no answer" if it gets stopped along the way. 
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Application 5.3.1.2. Experiments are supposed to be performed objectively, but suppose 
we imagine that changing the person who performs the experiment, say in psychology, 
may change the outcome. Let A be the set of experimenters, let X be the parameter space 
for the experimental variables (e.g. X = Age x Income) and let Y be the observation 
space (e.g. Y = propensity for violence). Then whereas we want to think of such an 
experiment as telling us about a function / : X ^ y, we may want to make some of the 
context explicit by including information about who performed the experiment. That is, 
we are really finding a function f : X x A ^ Y. 

However, it may be the case that even ascertaining someones age or income, which is 
done by asking that person, is subject to who in A is doing the asking, and so we again 
want to consider the experimenter as part of the equation. In this case, we can use a 
monad to hide the fact that everything in sight is assumed to be infiuenced by A. 

We will return to this in Examples 5.3.2.6 and 5.3.3.4. 

00 



5.3.2 Definition and examples 

What aspects of Example 5.3.1.1 are really about monads, and what aspects are just 
about partial functions in particular? It is a functor and a pair of natural transformations 
that showed up in (5.5), (5.4), and (5.6). In this section we will give the definition and 
a few examples. We will return to our story about how monads formalize context in 
Section 5.3.3. 

Definition 5.3.2.1. A monad on Set consists of some constituents (A. functor, B. unit 
map, C. multiplication map) conforming to some laws (1. unit laws, 2. associativity 
law), as follows: 

A. a functor T: Set Set, 

B. a natural transformation rj: idset and 

C. a natural transformation ji: T o T ^ T 

We sometimes refer to the functor T as though it were the whole monad; we call rj the 
unit map and we call /i the multiplication map. The following laws harmonize these 
constituents 

1. The following diagrams of functors Set Set commute: 

T o idset ^ ToT idset o T ^ ToT 




T T 



2. The following diagram of functors Set Set commutes: 



^ ^ ^ uoidT ^ ^ 

T oT oT — — -^ToT 



idx-o/i 

Y 

ToT 



-^T 
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Example 5.3.2.2. We now go through Definition 5.3.2.1 using what is called the List 
monad. The first step is to give a functor List: Set Set, which we did in Example 
4.1.2.15. Recall that if X = {p^q^r} then List(X) includes the empty list [ ], singleton 
lists, such as [p], and any other list of elements in X, such as [p^p^r^q^p]. Given a 
function / : X ^ F, one obtains a function List(/): List(X) List(y) by entry-wise 
application of /. 

As a monad, the functor List comes with two natural transformations, a unit map 
T] and a multiplication map /i. Given a set X, the unit map rjx • X ^ List(X) returns 
singleton lists as follows 



X 



rix 



■List(X) 



ph 



q^ 



r h 



[p] 

[q] 

[r] 



Given a set X, the multiplication map /ix • List(List(X)) List(X) flattens lists of lists 
as follows. 



List(List(X)) 



MX 



■ List(X) 



r],[],[q,r,p, r],[r]] h 



[q,p,r,q,r,p,r, r] 



The naturality of t] and /i just mean that these maps work appropriately well under 
term- by-term replacement by a function f : X . Finally the three monad laws from 
Definition 5.3.2.1 can be exemplified as follows: 



b, q^ q] h^^^^ [[p], [(7], [q\\ 



b, q^ q] 




b, q, q] H^^^^ [[p, q\] 



b. d 




[[b, M], [[], b ^, <!]]] [b, [rl [], b ^, ^]] 

idListo/i 

[b, r], b q^ q\] i ^ b. q] 

Exercise 5.3.2.3. Let P: Set Set be the powerset functor, so that given a function 
f : X ^ Y the function P(/) : P(X) ¥(Y) is given by taking images. 

a.) Make sense of the following statement: "with rj defined by singleton subsets and with 
fi defined by union, T := (P, is a monad". 
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b. ) With X = {a, 6}, write down the function rjx as a 2-row, 2-column table, and write 

down the function /ix as a 16-row, 2-column table (you can stop after 5 rows if you 
fully get it). 

c. ) Check that you believe the monad laws from Definition 5.3.2.1. 



Example 5.3.2.4. Here is the monad for partial functions. The functor T: Set Set 
sends a set X to the set X u {©}. Clearly, given a function f : X ^ Y there is an induced 
function / u{©}: X u {©} Y u so this is a functor. The natural transformation 
rj: id ^ T is given on a set X by the component function 

that includes X ^ X u {©}. Finally, the natural transformation /a: T o T ^ T is given 
on a set X by the component function 

/ix : X u {©} u {©} — > X u {©} 

that collapses both copies of ©. 

Exercise 5.3.2.5. Let £^ be a set, elements we will refer to as exceptions. We imagine 
that a function f : X ^ Y either outputs a value or one of these exceptions, which might 
be things like "overfiow!" or "division by zero!", etc. Let T: Set Set be the functor 
X ^ Xu£^. Follow Example 5.3.2.4 and come up with a unit map rj and a multiplication 
map fi for which (T, r^, fi) is a monad. 

Example 5.3.2.6. Fix a set A. LetT: Set Set be given by T(X) = X^ = Homset(^,^); 
this is a functor. For a set X, let rjx'- X ^ T{X) be given by the constant function, 
X ^ Cx'. A ^ X where Cx{a) = x for all a e A. To specify a function 

fix: Homset(AT(X)) ^ Homset(A,X), 

we curry and need a function 74xHomset(^, ^(^)) ~^ We have an evaluation function 
(see Exercise 2.7.2.5) ev: Ax Homset(^, ^(-^)) ^^i^)^ and we have an identity func- 
tion id^ : A ^ A, so we have a function (id^ xev): Ax Homset(^, ^(-^)) — ^ ^ ^ ^(-^)- 
Composing that with another evaluation function A x Homset(^, -^) X yields our 
desired fix- 

Remark 5.3.2.7. Monads can be defined on categories other than Set. In fact, for any 
category C one can take Definition 5.3.2.1 and replace every occurrence of Set with C 
and obtain the definition for monads on C. We have actually seen a monad (Paths, r^, fi) 
on the category Grph of graphs before, namely in Examples 4.3.1.5 and 4.3.1.6. That 
is. Paths: Grph Grph, which sends a graph to its paths-graph is the functor part. 
The unit map rj includes a graph into its paths graph using the observation that every 
arrow is a path of length 1. And the multiplication map ji concatenates paths of paths. 
The Kleisli category of this monad (see Definition 5.3.3.1) is used, e.g. in (4.14) to define 
morphisms of database schemas. 

5.3.3 Kleisli category of a monad 

Given a monad T := (T, 7^,/i), we can form a new category Kls(T). 
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Definition 5.3.3.1. Let T = (T, t^, /i) be a monad on Set. Form a new category, called 
the Kleisli category for J ^ denoted Kls(T), with sets as objects, Ob(Kls(T)) := Ob(Set), 
and with 

HomKis(T)(^,>^) Homset(X,T(r)) 

for sets X, Y . The identity morphism \dx : X ^ X in Kls(T) is given by 77: X ^ T{X) 
in Set. The composition of morphisms f. X and g: Y ^ Z in Kls(T) is given as 
follows. Writing them as functions, we have / : X ^ T(Y) and g: Y ^ T[Z). The first 
step is to apply the functor T to ^, giving T[g) : T{Y) T{T{Z)). Then compose with 
/ to get T{g) o /: X ^ T{T{Z)). Finally, compose with /i^: T{T{Z)) T{Z) to get 
the required function X T{Z). The associativity of this composition formula follows 
from the associativity law for monads. 

Example 5.3.3.2. Recall the monad T for partial functions, T(X) = X u {©}, from 
Example 5.3.2.4. The Kleisli category Kls(T) has sets as objects, but a morphism 
f: X ^ Y means a function X^Fuj©}, i.ea partial function. Given another 
morphism g: Y ^ Z, the composition formula in Kls(T) ensures that go f : X ^ Z has 
the appropriate behavior. 

Note how this monad allows us to make explicit our assumption that all functions 
are partial, and then hide it away from our notation. 

Remark 5.3.3.3. For any monad T = (T, 7^, /i) on Set, there is a functor i : Set Kls(T) 
given as follows. On objects we have Ob(Kls(T)) = Ob (Set), so take i = idob(Set)- 
Given a morphism / : X ^ F in Set, we need a morphism i{f) : X ^ F in Kls(T), i.e. 

a function i{f) : X T{Y). We assign i{f) to be the composite X ^Y ^ T(Y). The 
functoriality of this mapping follows from the unit law for monads. 

The point is that any ordinary function (morphism in Set) has an interpretation as 
a morphism in the Kleisli category of any monad. More categorically, there is a functor 
Set ^ Kls(T). 

Example 5.3.3.4. In this example we return to the setting laid out by Application 5.3.1.2 
where we had a set A of experimenters and assumed that the person doing the experiment 
may affect the outcome. We use the monad T = (T, 77, ja) from Example 5.3.2.6 and 
hope that Kls(T) will conform to our understanding of how to manage the affect of the 
experimenter on data. 

The objects of Kls(T) are ordinary sets, but a map / : X ^ F in Kls(T) is a function 
X Y^. By currying this is the same as a function X x A ^ F , as desired. To compose 
/ with g: Y ^ Z in Kls(T), we follow the formula. It turns out to be equivalent to the 
following. We have a function X x A ^ Y and a function Y x A ^ Z. Modifying the 
first slightly, we have a function X x A ^ Y x A^ hy identity on A, and we can now 
compose to get X x A ^ Z . 

What does this say in terms of experimenters affecting data gathering? It says that 
if we work within Kls(T) then we will be able to assume that the experimenter is being 
taken into account; all proposed functions X ^ Y are actually functions A x X ^ Y . 
The natural way to compose these experiments is that we only consider the data from 
one experiment to feed into another if the experimenter is the same in both experiments. 

8 

^This requirement seems a bit stringent, but it can be mitigated in a variety of ways. One such way 
is to notice that by Remark 5.3.3.3 that we have not added any requirement, because any old way of 
doing business yields a valid new way of doing business (we just say "every experimenter would get the 
same result"). Another way would be to hand off the experiment results to another person, who could 
carry it forward (see Example 5.3.3.8). 
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Exercise 5.3.3.5. In Exercise 5.3.2.3 we discussed the power set monad T = (P, 77,/i). 



a. ) Can you find a way to relate the morphisms in Kls(T) to relations? That is, given 

a morphism /: A ^ B in Kls(T), is there a natural way to associate to it a relation 
R<^AxB7 

b. ) How does the composition formula in Kls(T) relate to the composition of relations 

given in Definition 2.5.2.3? ^ 





Exercise 5.3.3.6. Let T = (P, 7^, /i) be the power set monad. The category Kls(T) is 
closed under binary products, i.e. every pair of objects A^ B e Ob(Kls(T)) have a 
product in Kls(T). What is the product of A = {1, 2, 3} and B = {a, b}7 

Exercise 5.3.3.7. Let T = (P, 7^, /i) be the power set monad. The category Kls(T) is 
closed under binary coproducts, i.e. every pair of objects A, B E Ob(Kls(T)) have a 
coproduct in Kls(T). What is the coproduct of A = {1, 2, 3} and B = {a, b}7 

Example 5.3.3.8. Suppose given a preorder A, which we speak of throughout this example 
as though it was the linear order given by time. There is a monad T = (T, 77, ja) that 
captures the idea that a function f.X^Y occurs in the context of time in the following 
sense: The output of / is determined not only by the element x E X on which it is applied 
but also by the time at which it was applied to x; and the output of / occurs at another 
time, which is not before the time of input. 

The functor part of the monad is given on X g Ob (Set) by 

T(X) = {p: A^ Ax X \ if p{a) = {a\x) then a' ^ a}. 

The unit r]x X ^ T[X) sends x to the function a ^ (a,x). The multiplication map 
jix'- T{T{X)) T{X) is roughly described as follows. If for every a e A you have a 
later element a' ^ a and a function p: A ^ A x X that takes elements of A to later 
elements of A and values of X, then p[a') is a still later element of A and a value of X, 
as desired. 

Morphisms in the Kleisli category Kls(T) can be curried to look like / : AxX AxY 
such that if f{a^x) = (a'^y) then a' ^ a. 

Example 5.3.3.9. Here we reconsider the image from the front cover of this book, repro- 
duced here. 



^Actually, Definition 2.5.2.3 is about composing spans, but a relation Q A x B is a kind of span, 
Ax B. 
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It looks like an olog, and all ologs are database schemas (see Sections. 5. 2. 14). But how 
is "analyzed by a person yields" a function from observations to hypotheses? The very 
name belies the fact that it is an invalid aspect in the sense of Section 2.3.2.1, because 
given an observation there may be more than one hypothesis yielded, corresponding to 
which person is doing the observing. In fact, all of the arrows in this diagram correspond 
to some hidden context involving people: the prediction is dependent on who analyzes 
the hypothesis, the specification of an experiment is dependent on who is motivated to 
specify it, and experiments may result in different observations by different observers. 

Without monads, the model of science proposed by this olog would be difficult to 
believe in. But by choosing a monad we can make explicit (and then hide from discourse) 
our implicit assumption that "of course everything is dependent on which human is doing 
the science". The choice of monad is an additional modeling choice. Do we want to 
incorporate the partial order of time? Do we want the scientist to be modified by each 
function (i.e. the person is changed when analyzing an observation to yield a hypothesis)? 
These are all interesting possibilities. 

One reasonable choice would be to use a simplification of the monad from Example 
5.3.3.8 in which the preorder A is the set of scientific models with the indiscrete preorder. 
This implies the following context: every morphism f : X ^ Y is really a morphism 
f : X X A ^ Y X A; while ostensibly giving a map from X to F, it is infiuenced by 
the scientific model under which it is performed, and its outcome yields a new scientific 
model. 

Reading the olog in this context might look like this: 

A hypothesis (in the presence of a scientific model) analyzed by a person 
produces a prediction (in the presence of a scientific model), which motivates 
the specification of an experiment (in the presence of a scientific model) , 
which when executed results in an observation (in the presence of a scientific 
model), which analyzed by a person yields a hypothesis (in the presence of a 
scientific model). 

The parenthetical statements can be removed if we assume them to always be around, 
which can be done using the monad above. 
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5.3.4 Monads in databases 



In this section we discuss how to record data in the presence of a monad. The idea is quite 
simple. Given a schema (category) C, an ordinary instance is a functor I : C ^ Set. But 
if T = (T, 7^, /i) is a monad, then a Kleisli T -instance on C is a functor J: C ^ Kls(T). 
Such a functor associates to every object c g Ob(C) a set J(c), and to every arrow 
/: c ^ in C a morphism J(/) : J(c) J{c') in Kls(T). How does this look in terms 
of tables? 

Recall that to represent an ordinary database instance I: C ^ Set, we use a tabular 
format in which every object c G Ob(C) is displayed as a table including one ID column 
and an additional column for every arrow emanating from c. In the ID column of table 
c were elements of the set /(c) and in the column assigned to some arrow /: c ^ c' the 
cells were elements of the set I{c'). 

To represent a Kleisli database instance J: C KlsT is similar; we again use a 
tabular format in which every object c e Ob(C) is displayed as a table including one ID 
column and an additional column for every arrow emanating from c. In the ID column 
of table c are again elements of the set J(c); however in the column assigned to some 
arrow /: c ^ are not elements of J(c') but T-values in J(c'), i.e. elements of T{J{d)). 

Example 5.3.4.1. Let T = (T^rj^ii) be the monad for partial functions, as discussed in 
Example 5.3.1.1. Given any schema C, we can represent a Kleisli T-instance I: C ^ 
Kls(T) in tabular format. To every object c g Ob(C) we'll have a set /(c) of rows, and 
given a column c ^ c' every row will produce either a value in I{c') or fail to produce 
a value; this is the essence of partial functions. We might denote the absence of a value 
using ©. 

Consider the schema indexing graphs 



C :-- 



Arrow 



src Vertex 



tgt 



As we discussed in Section 4.2.1.19, an ordinary instance on C represents a graph. 



/ : = 




Arrow (/) 


ID 


src 


tgt 


/ 


V 


w 


9 


w 


X 


h 


w 


X 



Vertex (/) 



ID 



A Kleisli T-instance on C represents graphs in which edges can fail to have a source 
vertex, fail to have a target vertex, or both. 



J: = 




Arrow (J) 


ID 


src 


tgt 


/ 


V 


w 


9 


w 


X 


h 


w 


X 


i 


V 


© 


3 


© 


© 



Vertex (J) 



ID 



w 



The context of these tables is that of partial functions, so we do not need a reference for 
© in the vertex table. Mathematically, the morphism J (src) : J( Arrow) J( Vert ex) 
needs to be a function J(Arrow) J(Vertex) u {©}, and it is. 
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5.3.4.2 Probability distributions 



Let [0, 1] ^ R denote the set of real numbers between and 1. Let X be a set and 
p: X ^ [0, 1] a function. We say that p is a finitary probability distribution on X if 
there exists a finite subset W X such that 



(5.7) 



ueW 



and such that p{x) > if and only x E W. Note that W is unique if it exists; we call 
it the support of p and denote it Supp(p). Note also that if X is a finite set then every 
function p satisfying (5.7) is a finitary probability distribution on X. 

For any set X, let Dist(X) denote the set of finitary probability distributions on X. It 
is easy to check that given a function f : X ^ Y one obtains a function Dist(/) : Dist(X) - 
Dist(y) by Dist(/)(y) = 2/(x)=2y^(^)- Thus we can consider Dist: Set Set as a 
functor, and in fact the functor part of a monad. Its unit rj: X ^ Dist(X) is given by the 
Kronecker delta function x ^ where Sx{x) = 1 and Sx{x^) = for x^ x. Its multipli- 
cation /i: Dist(Dist(X)) Dist(X) is given by weighted sum: given a finitary proba- 
bility distribution w: Dist(X) [0, 1] and x G X, put ii{w){x) = J]p^supp{w) ^{p)p{x)- 

Example 5.3.4.3 (Markov chains). Let Coop be the loop schema, 



Coop : = 



as in Example 3.5.2.9. A Dist-instance on Coop is equivalent to a time- homogeneous 
Markov chain. To be explicit, a functor 5: Coop KlsDist assigns to the unique object 
8 E Oh{Coop) di set S = S{s), which we call the state space, and to / : s ^ s a function 
S{f): S ^ Dist(S'), which sends each element x e S to some probability distribution on 
elements of 5*. For example, the table S on the left corresponds to the Markov matrix 
M on the right below: 




S:-- 



s 


ID 


f 


1 


.5(l)+.5(2) 


2 


1(2) 


3 


.7(l)+.3(3) 


4 


.4(l)+.3(2)+.3(4) 



M :-- 



0.5 0.5 
1 
0.7 

\ 0.4 0.3 







0.3 

0.3 



(5., 



As one might hope, for any natural number n G M the map : S ^ Dist(5') 
corresponds to the matrix M^, which sends an element in S to its probable location 
after n iterations of the transition map. 

Application 5.3.4.4. Every star emits a spectrum of light, which can be understood as 
a distribution on the electromagnetic spectrum. Given an object B on earth, different 
parts of B will absorb radiation at different rates. Thus B produces a function from the 
electromagnetic spectrum to distributions of energy absorption. In the context of the 
probability distributions monad, we can record data on the schema 



star emits wavelengths absorbed by B energies 
• ^ • ^ • 
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The composition formula for Kleisli categories is the desired one: to each star we associate 
the weighted sum of energy absorption rates over the set of wavelengths emitted by the 
star. 

00 



5.4 Operads 

In this section we briefly introduce operads, which are generalizations of categories. 
They often are useful for speaking about self-similarity of structure. For example, we 
will use them to model agents made up of smaller agents, or materials made up of smaller 
materials. This association with self-similarity is not really inherent in the definition, 
but it tends to emerge in our thinking about many operads used in practice. 

Let me begin with a warning. 

Warning 5.4.0.5. My use of the term operad is not entirely standard and conflicts with 
widespread usage. The more common term for what I am calling an operad is multicat- 
egory or colored operad. An operad classically is a multicategory with one object, and 
a colored operad is a multicategory. The analogy is that "operad is to multicategory as 
monoid is to category". The term multicategory stems from the fact that the morphisms 
in a multicategory have many, rather than one, input. But there is nothing really "multi" 
about the multicategory itself, only its morphisms. Probably the real reason though is 
that I find the term multicategory to be clunky and the term operad to be sleek, clocking 
in at half the syllables. I apologize if my break with standard terminology causes any 
confusion. 

This introduction to operads is quite short. One should see [Lei] for an excellent 
treatment. 



5.4.1 Definition and classical examples 

An operad is like a category in that it has objects, morphisms, and a composition formula, 
and it follows an identity law and an associativity law. The difference is that each 
morphism has many inputs (and one output). 




The description of composition in an operad is a bit heavier than it is in a category, but 
the idea fairly straightforward. Here is a picture of morphisms being composed. 
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Note that S and T disappear from the composition, but this is analogous to the way the 
middle object disappears from the composition of morphisms in a category 

r n I 1 

— - — > S — - — > X ^ the arrows to the left compose to give — — > X ^ 

Here is the definition. 

Definition 5.4.1.1. An operad O consists of some constituents (A. objects, B. mor- 
phisms, C. identities, D. compositions) conforming to some laws (1. identity law, 2. 
associativity law), as follows: 

A. A collection Ob((9), elements of which are called objects. 

B. For each natural number n g M and sequence (xi, . . . , x^, y) of objects in 
Ob((9), a set Homc)(xi, . . . , x^; y) e Set. Its elements are called morphisms 
from xi,...,Xn to y. 

C. For every object x e Ob(C), a specified morphism denoted id^^ e }iomo{x;x) 
called the identity morphism on x. 

D. For every natural number n G M, natural numbers mi, . . . , , g M, sequences 
(^1,1,^1,2, ... ,^i,mi,yi) through (xn,i,Xn,2, . . . ,^n,m^,yn), and element z (ah 
in Ob(0)), a function 

n 

o: Homc)(yi,...,yn;^) x ]^Homc)( 

i=l 

> Homc)(xi,i, Xi,2, . . . , Xn,mr,;z), 
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called the composition formula. 

Given objects xi, . . . , x^, g Ob(C), we can denote a morphism / e Homc(xi, . . . , x^; ?/) 
by /: y] we say that each Xi is a domain object of / and that y is 

the codomain object of /. We use infix notation for the composition formula, such as 

^o(/l,...,/n). 

Here are the laws that harmonize the above constituents: 

1. for every xi, . . . , x^-, y £ Ob(C) and every morphism / : (xi, . . . , x^) we have 

/o (id^,,...,ida,^) = / and id^ o / = /; 

and; 

2. an appropriate associative law (see [Lei, Definition 2.1.1]). 

Example 5.4.1.2. Let Sets denote the operad defined as follows. For objects we put 
Ob(Sets) = Ob(Set). For a natural number n g M and sets Xi, . . . , X^, put 

Homsets(^i,...,^n;^) '= Homset(^i X ••• X Xn,Y). 

Given functions /i : (Xi,i x • • • x Xi,^J Yi through : (Xn,i X • • • X Xfi^rnn) ~^ Yn 
and a function Yi x • • • x ^ Z, the universal property provides us a unique function 
of the form (Xi^i x • • • x X^^^^) — > Z, giving rise to our composition formula. 

Example 5.4.1.3. An operad commonly used in mathematics is called the little n-cubes 
operad. We'll focus on n = 2 and talk about the little squares operad O. Here the set 
of objects consists of only one object, Ob(0) = {□}, which we denote by a square. For 
a natural number n E M, a morphism / : (□,□,...,□) — > □ is a positioning of n non- 
overlapping squares inside of a square. Here is a picture of a morphism (Xi , X2, X3) F, 
where Xi = X2 = X3 = Y = a. 



Y 




The composition law says that given a positioning of small squares inside a large square, 
and given a positioning of tiny squares inside each of those small squares, we get a 
positioning of tiny squares inside a large square. A picture is shown in Figure 5.9. 
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X2 


X3 


□ 




□ 




□ 




□ 






□ 




□ 







o 





Figure 5.9: Here we show a morphism {Xi^ Xs) Y and morphisms 1^1,2) 

(^2,1, ^2,2, ^2,3) -^2, and (1^3,1) X3, each of which is a positioning of squares 
inside a square. The composition law scales and positions the squares in the "obvious" 
way. 

Hopefully, what we meant by "self-similarity" in the introduction to this section (see 
page 212) is becoming clear. 

Exercise 5.4.1.4. Consider an operad O like the little squares operad from Example 
5.4.1.3, except with three objects: square, circle, equilateral triangle. A morphism is 
again a positioning of shapes inside of a shape. 

a. ) Draw an example of a morphism / from two circles and a square to a triangle. 

b. ) Find three other morphisms that compose into /, and draw the composite. 



5.4.1.5 Operads: morphisms and algebras 

If operads are like categories, then we can define things like functors and call such a 
thing a morphism of operads (like a functor is a morphism of categories). We sketch the 
definition here. 
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Definition 5.4.1.6. Let O and O' be operads. A morphism of operads from O to O' ^ 
denoted F: O ^ O' consists of some constituents (A. on-objects part, B. on-morphisms 
part) conforming to some laws (1. preservation of identities, 2. preservation of compo- 
sition), as follows: 

A. a function Ob(F) : Ob(0) ^ Oh{0')] and 

B. for every sequence of objects xi, . . . , x^, ^ e Ob(0), a function 

YLompixi, . . . ,Xn;y): Homc>(xi, ...,Xn',y)^ Homc>/(F(c), F(d)), 
which we sometimes denote simply by F: Homc(c, d) Hom^/ (F(c), F((i)). 
The following laws harmonize the above constituents: 

1. Identities are preserved by F. That is, for any object c e Ob(C), we have 
F(idc) = idi?(c); 

2. Composition is preserved by F. That is, for any objects bij^Ci^d e Ob(C) and 
morphisms gi : bij Ci and hi'. Ci ^ we have F{h ^ Qi) = F{h) o F{gi). 

Exercise 5.4.1.7. Let O denote the little squares operad from Example 5.4.1.3 and let 
O' denote the operad you constructed in Exercise 5.4.1.4. 

a. ) Can you come up with an operad morphism O ^ O'l 

b. ) What about an operad morphism O' Ol 



Definition 5.4.1.8. Let O be an operad. An algebra on O is a morphism of operads 
A: O Sets. 

Remark 5.4.1.9. Every category can be construed as an operad (yes, there is a functor 
Cat Oprd), by simply not including non-unary morphisms. That is, given a category 
C, one makes an operad O with Ob(0) := Ob(C) and with 



Just like a schema is a category presentation, it is possible to discuss operad presentations 
by generators and relations. Under this analogy, an algebra on an operad corresponds 
to an instance on a schema. 

5.4.2 Applications of operads and their algebras 

Hierarchical structures may be well- modeled by operads. Describing such structures 
using operads and their algebras allows one to make appropriate distinctions between 
different types of thinking. For example, the allowable formations are encoded in the 
operad, whereas the elements that will fit into those formations are encoded in the 
algebra. Morphisms of algebras are high-level understandings of how elements of very 
different types (such as materials vs. numbers) can occupy the same place in the structure 
and be compared. We will give examples below. 

■"^^Here is how the indexing works. Choose n G N, and for each i E n choose rrii G N. Then for each 
i E n we have a morphism gi : {bi^i, . . . , bi^rni) Q and we also have a morphism h: (ci, . . . , Cn) d. 
The law here is that F{ho (gi,. . [ , gn)) ^ F\h) o . . .,F(gn)). 
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Application 5.4.2.1. Every material is composed of constituent materials, arranged in 
certain patterns. (In case the material is "pure", we consider the material to consist 
of itself as the sole constituent.) Each of these constituent materials each is itself an 
arrangement of constituent materials. Thus we see a kind of self-similarity which we can 
model with operads. 



(5.10) 

For example, a tendon is made of collagen fibers that are assembled in series and 
then in parallel, in a specific way. Each collagen fibre is made of collagen fibrils that are 
again assembled in series and then in parallel, with slightly different specifications. We 
can continue down, perhaps indefinitely, though our resolution fails at some point. A 
collagen fibril is made up of tropocollagen collagen molecules, which are twisted ropes 
of collagen molecules, etc.^^ 

Here is how operads might be employed. We want the same operad to model both 
actual materials, theoretical materials, and functional properties; that is we want more 
than one algebra on the same operad. 

The operad O should abstractly model the structure, but not the substance being 
structured. Imagine that each of the shapes (including the background "shape") in 
Dia gram (5.10) is a place-holder, saying something like ''''your material here^\ Each 
morphism (that's what (5.10) is a picture of) represents a construction of a material out 
of parts. In our picture, it appears we are only concerned with the spacial arrangements, 
but there is far more flexibility than that. Whether we want to allow for additional 
details beyond spacial arrangements is the kinds of choice we make in a meeting called 
"what operad should we use?" 

00 

Application 5.4.2.2. Suppose we have chosen an operad O to model the structure of 
materials. Each object of O might correspond to a certain quality of material, and each 
morphism corresponds to an arrangement of various qualities to form a new quality. An 
algebra A: O ^ Sets on O forces us to choose what substances will fill in for these 
qualities. For every object x e Ob((9), we want a set A{x) which will be the set of 

-'^-'^ Thanks to Sandra Shefelbine for explaining the hierarchical nature of collagen to me. Any errors 
are my own. 
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materials with that quahty. For every arrangement, i.e. morphism, /: (xi, . . . ,Xn) 
and every choice ai e . . . e A{xn) of materials, we need to understand what 

material = A(/)(ai, . . . , a^) e A{y) will emerge when these materials are arranged in 
accordance with /. We are really pinning ourselves down here. 

But there may be more than one interesting algebra on O. Suppose that B : O ^ Sets 
is an algebra of strengths rather than materials. For each object x e Ob((9), which 
represents some quality, we let B{x) be the set of possible strengths that something of 
quality x can have. Then for each arrangement, i.e. morphism, /: 
and every choice bi E 5 (x i ),..., 6^ ^ B{xn) of strengths, we need to understand what 
strength h' = . . . , 6^) g B(y) will emerge when these strengths are arranged in 

accordance with /. Certainly an impressive achievement! 

Finally, a morphism of algebras S: A ^ B would consist of a coherent system for 
assigning to each material a e A{X) of a given quality x a specific strength S{a) e B{X), 
in such a way that morphisms behaved appropriately. In this language we have stated a 
very precise goal for the field of material mechanics. 

00 

5.4.2.3 Wiring diagrams 

Example 5.4.2.4. Here we describe an operad of relations^ which we will denote by IZ. 
The objects are sets, Ob(7^) = Ob(Set). A morphism /: (xi, X2, . . . , Xn) — ^ in 1Z is 
a diagram in Set of the form 




(5.11) 



such that the induced function R — > (xi x X2 x • • • x x x^) is an injection. 

We use a composition formula similar to that in Definition 2.5.2.3. Namely, we form 
a fiber product 



FP 



n.enn 




One can show that the induces function FP — > [Ylien Yljenn j x is an injection, so 
we have a valid composition formula. Finally, the associativity and identity laws hold. 

12 

Application 5.4.2.5. Suppose we are trying to model life in the following way. We define 
an entity as a set of phenomena, but in order to use colloquial language we say the entity 
is able to experience that set of phenomena. We also want to be able to put entities 

■"^^Technically we need to use isomorphism classes of cone points, but we don't worry about this here. 
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together to form a super-entity, so we have a notion of morphism /: (ei, . . . , e^) — ^ e' 
defined as a relation as in (5.11). The idea is that the morphism / is a way of translating 
between the phenomena that may be experienced by the sub-entities and the phenomena 
that may be experienced by the super-entity. 

The operad IZ from Example 5.4.2.4 becomes useful as a language for discussing issues 
in this domain. 00 

Example 5.4.2.6. Let IZ be the operad of relations from Example 5.4.2.4. Consider the 
algebra S:1Z ^ Sets given by S{x) = P(x). Given a morphism Yli^i ^ ^ V ^"^^ 
subsets Xi^ we have a subset y\ - x\ ^ Hi We take the fiber product 



FP 




and the image of FP ^ y is a subset of y. 

Application 5.4.2.7. Following Application 5.4.2.5 we can use Example 5.4.2.6 as a model 
of survival. Each entity survives only for a subset of the phenomena that it can expe- 
rience. Under this interpretation, the algebra from Example 5.4.2.6 defines survival as 
the survival of all parts. That is, suppose that we understand how a super-entity is 
composed of sub-entities in the sense that we have a translation between the set of phe- 
nomena that may be experienced across the sub-entities and the set of phenomena that 
may be experienced by the super-entity. Then the super-entity will survive exactly those 
phenomena which translate to phenomena for which each sub-entity desires. 

Perhaps a better term than survival would be "allowance". A bureaucracy consists of 
a set of smaller bureaucracies, each of which allows certain phenomena to pass; the whole 
bureaucracy allows something to pass if and only if, when translated to the perspective 
of each sub-bureaucracy, it is allowed to pass there. 

00 

Example 5.4.2.8. In this example we discuss wiring diagrams that look like this: 




The operad in question will be denoted W. Its objects are pairs (C, s) where C is a finite 
set and v: C ^ Ob(Set) is a function. Think of such an object as a circle with C-many 
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cables sticking out of it; each cable c is assigned a set v{c) corresponding to the set of 
values that can be carried on that cable. For example E2 = (C^v) where \C\ = 11 and 
we consider v to be specified by declaring that black wires carry Z and red wires carry 
{sweet, sour, salty, bitter, umami}. 

The morphisms in W will be pictures as above, formalized as follows. Given ob- 
jects (Ci, vi), . . . , (Cn, Vn), {D,w), a morphism F: ((Ci, vi), . . . , (Cn, ^n)) — > {D,w) is 
a commutative diagram of sets 




Ob(Set) 



such that i and j are jointly surjective. 

Composition of morphisms is easily understood in pictures: given wiring diagrams 
inside of wiring diagrams, we can throw away the intermediary circles. In terms of sets, 
we perform a pushout. 

There is a morphism of operads W ^ S given by sending iC^v) to ncec^(^)- 
idea is that to an entity defined as having a bunch of cables carrying variables, a phe- 
nomenon is the same thing as a choice of value on each cable. A wiring diagram translates 
between values experienced locally and values experienced globally. 

Application 5.4.2.9. In cognitive neuroscience or in economics, it may be that we want to 
understand the behavior of an entity such as a mind or a society in terms of its structure. 
Knowing the connection pattern (connectome, supply chain) of sub-entities should help 
us understand how big changes are generated from small ones. 

Under the morphism W ^ S the algebra S Sets from Application 5.4.2.7 becomes 
an algebra W Sets. To each entity we now associate some subset of the value- 
assignments it can carry. 00 

Application 5.4.2.10. In [RS], Radul and Sussman discuss propagator networks. These 
can presumably be understood in terms of wiring diagrams and their algebra of relations. 

00 



■"^^If one is concerned with cardinality issues, fix a cardinality k, and replace Ob(Set) everywhere with 
Ob(Set<«,). 
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